Search papers, labs, and topics across Lattice.
New York University 2 UC Berkeley 3 Meta FAIR
1
0
3
Camera pose, largely ignored in video LLMs, unlocks significant gains in spatial reasoning and even improves general video QA when used as a lightweight supervisory signal.