Search papers, labs, and topics across Lattice.
Department of Statistics, Korea University *Equal contribution
1
0
3
Achieving fine-grained semantic alignment in text-to-video generation is now possible with a model that explicitly verifies every prompt condition against visual evidence.