Search papers, labs, and topics across Lattice.
3
0
4
5
Training on real speech prosody alone can cut speech deepfake error rates by over 70% on emotional attacks, a blindspot for current detectors.
LLMs can spot fake words in speech by recognizing common editing patterns, but this reliance on learned biases hinders generalization to new manipulation techniques.
Achieve zero-shot voice conversion competitive with methods requiring more data or training, using a simple, invertible linear method to disentangle speech content from speaker timbre.