Search papers, labs, and topics across Lattice.
University of Science and Technology of China
5
0
5
Achieving high-quality speech reconstruction at just 0.5 kbps could revolutionize low-bandwidth communication systems.
VoCodec achieves a remarkable 27% bitrate reduction while enhancing speech quality by intelligently allocating resources based on voicing characteristics.
Minor acoustic noise can nearly double the rate of unsafe outputs in clinical documentation, despite only a slight increase in Word Error Rate.
ASR-driven data augmentation boosts Alzheimer's detection accuracy by over 4%, showcasing the potential of synthetic speech in clinical diagnostics.
Seamless transitions between speech and singing modes are now driven purely by text context, achieving state-of-the-art results in code-switching synthesis.