Case WesternFeb 26, 2026arXiv:2602.23259

Risk-Aware World Model Predictive Control for Generalizable End-to-End Autonomous Driving

Jiangxin Sun, Jiangxin Sun, Feng Xue, Feng Xue, Teng Long, Teng Long, Chang Liu, Chang Liu, Jian-Fang Hu, Jian-Fang Hu, Wei-Shi Zheng, Wei-Shi Zheng, Nicu Sebe, N. Sebe

AI Summary

The paper introduces Risk-aware World Model Predictive Control (RaWMPC), a novel end-to-end autonomous driving framework that operates without expert action supervision. RaWMPC uses a world model to predict the outcomes of candidate actions and selects low-risk actions based on explicit risk evaluation, trained using a risk-aware interaction strategy to expose the model to hazardous behaviors. The framework distills risk-avoidance capabilities from the world model into a generative action proposal network via self-evaluation distillation, enabling the generation of low-risk candidate actions at test time.

Key Contribution

End-to-end autonomous driving can ditch expert demonstrations and still achieve state-of-the-art performance, thanks to a risk-aware world model that learns to predict and avoid hazardous outcomes.

Abstract

With advances in imitation learning (IL) and large-scale driving datasets, end-to-end autonomous driving (E2E-AD) has made great progress recently. Currently, IL-based methods have become a mainstream paradigm: models rely on standard driving behaviors given by experts, and learn to minimize the discrepancy between their actions and expert actions. However, this objective of"only driving like the expert"suffers from limited generalization: when encountering rare or unseen long-tail scenarios outside the distribution of expert demonstrations, models tend to produce unsafe decisions in the absence of prior experience. This raises a fundamental question: Can an E2E-AD system make reliable decisions without any expert action supervision? Motivated by this, we propose a unified framework named Risk-aware World Model Predictive Control (RaWMPC) to address this generalization dilemma through robust control, without reliance on expert demonstrations. Practically, RaWMPC leverages a world model to predict the consequences of multiple candidate actions and selects low-risk actions through explicit risk evaluation. To endow the world model with the ability to predict the outcomes of risky driving behaviors, we design a risk-aware interaction strategy that systematically exposes the world model to hazardous behaviors, making catastrophic outcomes predictable and thus avoidable. Furthermore, to generate low-risk candidate actions at test time, we introduce a self-evaluation distillation method to distill riskavoidance capabilities from the well-trained world model into a generative action proposal network without any expert demonstration. Extensive experiments show that RaWMPC outperforms state-of-the-art methods in both in-distribution and out-of-distribution scenarios, while providing superior decision interpretability.

Robotics & Embodied AI World Models & Planning

Citation Metrics

Citations0

Influential citations0

References98

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Risk-Aware World Model Predictive Control for Generalizable End-to-End Autonomous Driving

Related Papers