Search papers, labs, and topics across Lattice.
2
0
3
0
Injecting 3D scene understanding into VLAs without extra labels yields surprisingly large gains in robotic manipulation, suggesting a critical missing piece in current vision-language-action models.
Achieve state-of-the-art 3D object detection by fusing radar and camera data without relying on accurate ego-pose estimation, a common bottleneck in autonomous driving.