Search papers, labs, and topics across Lattice.
University of Science and Technology of China
4
0
4
Achieving high-quality speech reconstruction at just 0.5 kbps could revolutionize low-bandwidth communication systems.
VoCodec achieves a remarkable 27% bitrate reduction while enhancing speech quality by intelligently allocating resources based on voicing characteristics.
Minor acoustic noise can nearly double the rate of unsafe outputs in clinical documentation, despite only a slight increase in Word Error Rate.
Seamless transitions between speech and singing modes are now driven purely by text context, achieving state-of-the-art results in code-switching synthesis.