Search papers, labs, and topics across Lattice.
Beihang University;
1
0
3
6
Despite advances in multimodal models, they still struggle to understand spatial relationships from an egocentric perspective, as shown by a 37.66% performance gap on the new SAW-Bench benchmark.