Search papers, labs, and topics across Lattice.
Meituan
1
0
3
2
Finally, a single model handles multi-modal video generation, inpainting, and editing at cinematic resolutions with synchronized audio, all while accepting diverse inputs like text, images, video clips, and audio references.