Search papers, labs, and topics across Lattice.
This paper introduces LUNA-AD, a lightweight uncertainty-aware language model designed for autonomous driving, addressing the limitations of existing large language models in safety-critical applications. By employing a tri-system architecture that integrates multimodal behavioral reasoning with lifelong learning, LUNA-AD enhances decision-making diversity and efficiency. Experimental results on nuPlan benchmarks show that LUNA-AD outperforms existing frameworks in success rates while significantly reducing inference latency, marking a substantial advancement in the deployment of AI for autonomous driving.
LUNA-AD achieves state-of-the-art performance in autonomous driving while cutting inference latency, revolutionizing how AI can operate in safety-critical environments.
While large language models (LLMs) offer promising reasoning capabilities, their integration into safety-critical driving systems is hindered by limited reasoning diversity, high computational overhead, and static learning paradigms. To address these challenges, we propose LUNA-AD, a lightweight uncertainty-aware language model with lifelong learning for autonomous driving (AD). LUNA-AD features a tri-system architecture that reconciles complex multimodal behavioral reasoning, efficient deployment, and continual refinement. We design a multi-agent analytical system to generate uncertainty-aware decision-making demonstrations through diverse hypothesis exploration. A dual-head lightweight heuristic model is distilled to unify the inference of decision distributions and textual explanations while enabling efficient deployment. Furthermore, a reflection-driven lifelong learning mechanism operates on multimodal decision outputs and preserves strategic diversity, allowing for the refinement of candidate decisions and rationales via closed-loop feedback to enhance driving robustness. Extensive experiments on nuPlan benchmarks demonstrate that LUNA-AD achieves state-of-the-art success rates under both non-reactive and reactive modes, with drastically reduced inference latency compared to existing knowledge-driven AD frameworks.