Search papers, labs, and topics across Lattice.
The paper introduces CRASH, an LLM-based agent designed to automate the analysis of autonomous vehicle (AV) incident reports from the NHTSA database. CRASH processes both structured and unstructured data to generate summaries, attribute primary causes, and assess AV contribution to incidents. Results show CRASH attributes 64% of incidents to perception or planning failures and identifies rear-end collisions as a prevalent issue, while achieving 86% accuracy when validated by domain experts.
An LLM agent can accurately pinpoint perception and planning failures as the leading causes in over half of real-world autonomous vehicle incidents.
As AVs grow in complexity and diversity, identifying the root causes of operational failures has become increasingly complex. The heterogeneity of system architectures across manufacturers, ranging from end-to-end to modular designs, together with variations in algorithms and integration strategies, limits the standardization of incident investigations and hinders systematic safety analysis. This work examines real-world AV incidents reported in the NHTSA database. We curate a dataset of 2,168 cases reported between 2021 and 2025, representing more than 80 million miles driven. To process this data, we introduce CRASH, Cognitive Reasoning Agent for Safety Hazards, an LLM-based agent that automates reasoning over crash reports by leveraging both standardized fields and unstructured narrative descriptions. CRASH operates on a unified representation of each incident to generate concise summaries, attribute a primary cause, and assess whether the AV materially contributed to the event. Our findings show that (1) CRASH attributes 64% of incidents to perception or planning failures, underscoring the importance of reasoning-based analysis for accurate fault attribution; and (2) approximately 50% of reported incidents involve rear-end collisions, highlighting a persistent and unresolved challenge in autonomous driving deployment. We further validate CRASH with five domain experts, achieving 86% accuracy in attributing AV system failures. Overall, CRASH demonstrates strong potential as a scalable and interpretable tool for automated crash analysis, providing actionable insights to support safety research and the continued development of autonomous driving systems.