Search papers, labs, and topics across Lattice.
1
0
3
MLLMs can now better understand your pointing with Hand Intent Tokens (HINT), boosting accuracy on egocentric video question answering by 6.6% on a new benchmark.