Search papers, labs, and topics across Lattice.
The Hong Kong Polytechnic University
2
0
3
Speaker recognition accuracy improves dramatically when leveraging a U-Net-based fusion of noisy and enhanced speech, coupled with a novel training strategy.
Training on real speech prosody alone can cut speech deepfake error rates by over 70% on emotional attacks, a blindspot for current detectors.