Search papers, labs, and topics across Lattice.
1
0
3
0
Achieve state-of-the-art video reasoning by using visual prompting during training to guide reinforcement learning, then distilling this ability into a model that performs grounded reasoning on raw videos without prompts at inference time.