Search papers, labs, and topics across Lattice.
Hainan University
1
0
3
Treating geometry as a fundamental representational prerequisite, rather than a late-fusion auxiliary signal, significantly boosts spatio-temporal reasoning in vision-language models.