Search papers, labs, and topics across Lattice.
Nanyang Technological University
1
0
3
38
Text-to-video models can now generate videos that actually respect spatial relationships, thanks to a new geometry-based reward function that beats VLM-based alternatives.