Search papers, labs, and topics across Lattice.
Centre for Artificial Intelligence and Robotics, The State Key Laboratory of Internet, of Things for Smart City, University of Macau
1
0
3
VLMs can achieve state-of-the-art Vision-Language Navigation performance by explicitly training them to reason about past actions and predict future visual transitions.