Search papers, labs, and topics across Lattice.
1
0
3
Instead of training separate video diffusion models for each multimodal task, UniVidX learns a single model that handles diverse pixel-aligned video generation problems.