Search papers, labs, and topics across Lattice.
Open-Sora Plan Team, Peking University
3
0
7
Achieve near state-of-the-art video generation quality with OSP-Next, while realizing up to 2.27x speedups on Ascend 950PR GPUs through a novel combination of sparse attention, quantization, and parallelism techniques.
Forget specialized architectures: StepAudio 2.5 proves a single audio-language foundation, shaped by RLHF, can dominate ASR, TTS, and real-time dialogue simultaneously.
Video Transformers can achieve near-full attention accuracy with significantly less compute by focusing only on informative vertical vectors.