Search papers, labs, and topics across Lattice.
Nanjing University
1
0
3
2
Instead of training separate video diffusion models for each multimodal task, UniVidX learns a single model that handles diverse pixel-aligned video generation problems.