Search papers, labs, and topics across Lattice.
The Chinese University of Hong Kong, Tencent Ethereal Audio Lab
2
0
3
Forget flat, lifeless speech: this model uses self-critique to generate expressive speech rivaling GPT-4o-Audio, even with significantly less training data.
Slash spoken dialogue system latency by up to 51% with a new architecture that lets the system "listen-while-thinking" and "speak-while-thinking."