Search papers, labs, and topics across Lattice.
1
0
3
2
Forget local semantic alignment: CAST unlocks temporally coherent video retrieval and generation by explicitly modeling visual state transitions.