Search papers, labs, and topics across Lattice.
1
0
3
18
Finally, a single model handles multi-modal video generation, inpainting, and editing at cinematic resolutions with synchronized audio, all while accepting diverse inputs like text, images, video clips, and audio references.