Search papers, labs, and topics across Lattice.
The paper introduces AI-Paging, a network-driven architecture for AI-as-a-Service (AIaaS) that enables intent-to-model matching and execution placement under policy and QoS constraints. It proposes a control-plane transaction to resolve user intents into AI service identities (AISI), session tokens (AIST), and admission leases (COMMIT) for authorized access to AI execution anchors (AEXF). The prototype, built using existing 3GPP mechanisms, demonstrates lease-gated steering and make-before-break anchoring, ensuring AIaaS continuity under dynamic network conditions.
Forget static model selection – AI-Paging lets 6G networks dynamically steer AI service requests to optimal model instances based on real-time conditions and policies.
With AI-as-a-Service (AIaaS) now deployed across multiple providers and model tiers, selecting the appropriate model instance at run time is increasingly outside the end user's knowledge and operational control. Accordingly, the 6G service providers are envisioned to play a crucial role in exposing AIaaS in a setting where users submit only an intent while the network helps in the intent-to-model matching (resolution) and execution placement under policy, trust, and Quality of Service (QoS) constraints. The network role becomes to discover candidate execution endpoints and selects a suitable model/anchor under policy and QoS constraints in a process referred here to as AI-paging (by analogy to cellular call paging). In the proposed architecture, AI-paging is a control-plane transaction that resolves an intent into an AI service identity (AISI), a scoped session token (AIST), and an expiring admission lease (COMMIT) that authorizes user-plane steering to a selected AI execution anchor (AEXF) under a QoS binding. AI-Paging enforces two invariants: (i) lease-gated steering (without COMMIT, no steering state is installed) and (ii) make-before-break anchoring to support continuity and reliability of AIaaS services under dynamic network conditions. We prototype AI-Paging using existing control- and user-plane mechanisms (service-based control, QoS flows, and policy-based steering) with no new packet headers, ensuring compatibility with existing 3GPP-based exposure and management architectures, and evaluate transaction latency, relocation interruption, enforcement correctness under lease expiry, and audit-evidence overhead under mobility and failures.