Search papers, labs, and topics across Lattice.
KAIST, South Korea
1
0
3
Achieve more natural and synchronized video dubbing by conditioning a discrete flow matching TTS model on facial expressions and cross-modal alignment.