Jun 7, 2026arXiv:2606.08470

LUNA-AD: Lightweight Uncertainty-Aware Language Model with Lifelong Learning for Autonomous Driving

Ruoyu Yao, Pei Liu, Ruiguo Zhong, Mingxing Peng, Rui Yang, Jun Ma

AI Summary

This paper introduces LUNA-AD, a lightweight uncertainty-aware language model designed for autonomous driving, addressing the limitations of existing large language models in safety-critical applications. By employing a tri-system architecture that integrates multimodal behavioral reasoning with lifelong learning, LUNA-AD enhances decision-making diversity and efficiency. Experimental results on nuPlan benchmarks show that LUNA-AD outperforms existing frameworks in success rates while significantly reducing inference latency, marking a substantial advancement in the deployment of AI for autonomous driving.

Key Contribution

LUNA-AD achieves state-of-the-art performance in autonomous driving while cutting inference latency, revolutionizing how AI can operate in safety-critical environments.

Abstract

While large language models (LLMs) offer promising reasoning capabilities, their integration into safety-critical driving systems is hindered by limited reasoning diversity, high computational overhead, and static learning paradigms. To address these challenges, we propose LUNA-AD, a lightweight uncertainty-aware language model with lifelong learning for autonomous driving (AD). LUNA-AD features a tri-system architecture that reconciles complex multimodal behavioral reasoning, efficient deployment, and continual refinement. We design a multi-agent analytical system to generate uncertainty-aware decision-making demonstrations through diverse hypothesis exploration. A dual-head lightweight heuristic model is distilled to unify the inference of decision distributions and textual explanations while enabling efficient deployment. Furthermore, a reflection-driven lifelong learning mechanism operates on multimodal decision outputs and preserves strategic diversity, allowing for the refinement of candidate decisions and rationales via closed-loop feedback to enhance driving robustness. Extensive experiments on nuPlan benchmarks demonstrate that LUNA-AD achieves state-of-the-art success rates under both non-reactive and reactive modes, with drastically reduced inference latency compared to existing knowledge-driven AD frameworks.

Multimodal Models Reasoning & Chain-of-Thought Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

LUNA-AD: Lightweight Uncertainty-Aware Language Model with Lifelong Learning for Autonomous Driving

Related Papers