Search papers, labs, and topics across Lattice.
This paper introduces an active inference framework for UAV swarm trajectory planning, converting the problem into hierarchical probabilistic inference. Expert demonstrations generated by a genetic algorithm are used to learn a probabilistic world model capturing relationships between mission allocation, route ordering, and motion behaviors. Online, the swarm minimizes KL-divergence between posterior beliefs and expert distributions, enabling adaptive mission allocation, route planning, and collision avoidance without re-optimization.
UAV swarms can now adapt to changing conditions and replan trajectories in real-time by inferring expert-like behaviors from a learned probabilistic world model, avoiding computationally expensive re-optimization.
This paper presents an expert-guided active-inference-inspired framework for adaptive UAV swarm trajectory planning. The proposed method converts multi-UAV trajectory design from a repeated combinatorial optimization problem into a hierarchical probabilistic inference problem. In the offline phase, a genetic-algorithm planner with repulsive-force collision avoidance (GA--RF) generates expert demonstrations, which are abstracted into Mission, Route, and Motion dictionaries. These dictionaries are used to learn a probabilistic world model that captures how expert mission allocations induce route orders and how route orders induce motion-level behaviors. During online operation, the UAV swarm evaluates candidate actions by forming posterior beliefs over symbolic states and minimizing KL-divergence-based abnormality indicators with respect to expert-derived reference distributions. This enables mission allocation, route insertion, motion adaptation, and collision-aware replanning without rerunning the offline optimizer. Bayesian state estimators, including EKF and PF modules, are integrated at the motion level to improve trajectory correction under uncertainty. Simulation results show that the proposed framework preserves expert-like planning structure while producing smoother and more stable behavior than modified Q-learning. Additional validation using real-flight UAV trajectory data demonstrates that the learned world model can correct symbolic predictions under noisy and non-smooth observations, supporting its applicability to adaptive UAV swarm autonomy.