Search papers, labs, and topics across Lattice.
Zhejiang University
3
0
4
CineDance-1M sets a new standard for open-source cinematic audio-video generation, boasting over 1 million high-quality, structured video samples that could transform the landscape of multimedia AI.
MLLMs can revolutionize video understanding by integrating watching, remembering, and reasoning into a cohesive framework that addresses long-range dependencies and sparse evidence.
Achieving top-tier identity preservation in text-to-video generation without compromising on semantic fidelity, ST-DRC redefines the standards for high-quality video synthesis.