Search papers, labs, and topics across Lattice.
Tongji University,School of Computer Science and Technology,Shanghai,China
1
0
3
1
MLLMs can get a whopping 8% boost in spatial reasoning accuracy, with 50% less memory, simply by fusing geometric and semantic features in the vision encoder.