Search papers, labs, and topics across Lattice.
2
0
4
31
Video-LLMs can achieve more reliable reasoning by first constructing a compact, structured representation of salient events and their causal relationships.
Forget tedious pose annotations: this text-to-video approach generates realistic acrobatic human motions by cascading a text-to-skeleton model with a pose-conditioned diffusion model.