Search papers, labs, and topics across Lattice.
2
4
3
46
LLMs in autonomous driving get a reality check: AgentDrive, a new benchmark, exposes gaps in reasoning abilities, particularly in physics and structured scenarios, even as open models rapidly improve.
LLMs still struggle with ethical and resource-constrained decisions in UAV flight scenarios, despite strong performance in perception and policy reasoning, as revealed by a new 50,000-scenario benchmark.