Search papers, labs, and topics across Lattice.
Shanghai Innovation Institute, AGIBOT
2
0
5
$\tau_0$-WM outperforms traditional models by seamlessly integrating action prediction and evaluation, leading to superior performance in complex robotic tasks.
MLLMs can "hear" a little, but EgoSound reveals they're still largely deaf to the nuances of sound in egocentric video, especially when it comes to spatial and causal reasoning.