Search papers, labs, and topics across Lattice.
WeChat Vision, Tencent Inc
2
0
4
0
Audio-Omni can edit sound, music, and speech with a single model, rivaling specialized systems and unlocking capabilities like knowledge-augmented reasoning and zero-shot cross-lingual control.
Achieve high-fidelity, temporally coherent video editing without paired training data by combining sparse semantic control with dense motion and texture synthesis.