Search papers, labs, and topics across Lattice.
Corresponding author are Bo Cheng and Soujanya Poria
1
0
3
4
By explicitly grounding reasoning steps to visual objects, Chain-of-Glimpse enables more accurate and interpretable video understanding, outperforming object-agnostic methods on multiple benchmarks.