Search papers, labs, and topics across Lattice.
3
0
1
MeanVC 2 cuts voice conversion latency in half while enhancing robustness to low-quality audio references, revolutionizing real-time voice applications.
FlashTTS slashes First-Packet Latency to 325ms, revolutionizing real-time speech dialogue systems without sacrificing voice quality.
Achieving high accuracy in multi-speaker transcription, SoulX-Transcriber outperforms existing models by effectively addressing speaker overlap and rapid turn-taking.