Search papers, labs, and topics across Lattice.
Shanghai AI Laboratory
1
0
0
1
HarmoniDPO is proposed, a novel framework that integrates preference-based optimization into diffusion-based V2A generation and outperforms state-of-the-art methods in audio-video synchronization and subjective audio quality, offering a robust solution for generating realistic, human-preferred audio from video.