Search papers, labs, and topics across Lattice.
VCIP, CS, Nankai University
2
0
5
Achieve up to 2.5x faster video object removal with comparable visual quality by intelligently selecting only the essential tokens for processing in Diffusion Transformers.
Skip the bulky bidirectional teacher: this new method trains a fast, causal audio-video generator directly, slashing sampling steps while maintaining top-tier quality.