Search papers, labs, and topics across Lattice.
Tongji University
1
0
3
4
Uniformly sampling frames in video LLMs is leaving crucial temporal information on the cutting room floor: GroundVTS selectively attends to the most informative segments, substantially boosting grounding performance.