Search papers, labs, and topics across Lattice.
University of Oxford
3
0
5
Training on Syn4D could unlock breakthroughs in dynamic scene understanding, where current datasets fall short in providing dense, complete, and accurate geometric annotations.
Multi-event video generation gets a 33% quality boost with TS-Attn, a training-free attention mechanism that dynamically aligns video content with complex temporal prompts.
A reward model trained on spatial relationship preferences beats proprietary models at spatial understanding in text-to-image generation, and unlocks better RL-based image generation.