Search papers, labs, and topics across Lattice.
This paper introduces AIF-Router, a novel Active Inference-based routing framework designed to optimize AI service orchestration across heterogeneous edge environments. By leveraging Bayesian state inference and expected free energy minimization, AIF-Router autonomously adapts to dynamic workloads and infrastructure variability without requiring offline training. The framework demonstrates robust online learning capabilities, effectively balancing latency, throughput, and resource utilization even in the presence of edge node instability.
AIF-Router achieves stable online learning for adaptive AI service orchestration, even in unpredictable edge environments, showcasing the potential of Active Inference in real-world applications.
Edge computing enables AI inference closer to data sources, reducing latency and bandwidth costs. However, orchestrating AI services across the cloud-edge continuum remains challenging due to dynamic workloads and infrastructure variability. We present AIF-Router, an Active Inference--based routing framework that autonomously learns to balance latency, throughput, and resource utilization across multi-tier AI services without offline training. AIF-Router performs Bayesian state inference and expected free energy minimization to guide routing decisions based on observability-driven real-time metrics. Despite device instability on edge nodes, AIF-Router exhibits stable online learning behavior and demonstrates the feasibility of applying Active Inference for adaptive AI service orchestration in unreliable edge environments. Our findings highlight both the promise and practical challenges of deploying self-adaptive decision-making frameworks for real-world edge AI systems.