Search papers, labs, and topics across Lattice.
Qinghai Normal University
2
0
5
0
Stop guessing affordances from static scenes: A3R's agentic approach leverages cross-dimensional evidence acquisition to significantly outperform one-shot methods in complex 3D environments.
VLMs struggle to connect the dots between dynamic drone footage and satellite imagery, highlighting a critical gap in their spatial reasoning abilities.