Search papers, labs, and topics across Lattice.
2
0
3
2
Open-source TTS models can beat commercial systems in specific languages, but current instruction-following TTS still struggles with complex instructions like nuanced paralinguistic controls.
Achieve faster, more accurate turn-taking in spoken dialogue by fusing streaming speech recognition with raw audio cues – even when it's noisy.