Search papers, labs, and topics across Lattice.
ERNIE Team
1
0
3
Keyframe-residual captioning unlocks high-fidelity video-language supervision, surpassing direct VLM captioning in capturing fine-grained visual details.