Search papers, labs, and topics across Lattice.
2
0
4
EQA agents can now handle dynamic, human-populated scenes better thanks to a training-free method that selectively remembers only the most informative visual evidence.
Achieve state-of-the-art performance in vision-language-action tasks with Xiaomi-Robotics-0, a model that executes smoothly in real-time on real robots using a consumer-grade GPU.