Search papers, labs, and topics across Lattice.
This paper details a four-stage pipeline for scenario mining in the Argoverse 2 Challenge, leveraging a Claude Code agent for autonomous code generation and iterative training set screening. The approach employs a Timestamp Balanced Accuracy threshold of 0.8 to curate few-shot examples, followed by a semantic code review and scene-level verification using Qwen3-VL to enhance accuracy. The results demonstrate significant improvements in scenario identification, showcasing the effectiveness of integrating advanced language models in autonomous driving contexts.
Autonomous code generation combined with rigorous semantic review can drastically enhance scenario mining accuracy in complex driving environments.
We present our submission to the CVPR 2026 Argoverse 2 Scenario Mining Challenge. Our system uses a four-stage pipeline: (1) autonomous code generation via a Claude Code agent powered by GLM~5.1, (2) iterative training set screening with Timestamp Balanced Accuracy threshold 0.8 to curate few-shot examples, (3) semantic code review by a separate Claude Code session, and (4) Qwen3-VL scene-level verification to filter false positives. We report results on the Argoverse 2 test set.