Search papers, labs, and topics across Lattice.
S-Lab, NTU
3
0
5
Prisma-World achieves unprecedented cross-view consistency in multi-agent video generation by leveraging a joint geometry-aware denoising process.
Achieve unprecedented spatial control in single-image generation by injecting 3D positional encodings into a diffusion model, enabling precise manipulation of object placement and scale.
Forget painstakingly curating 3D-consistent datasets: this RL approach uses a 3D foundation model to *verify* consistency, turning 3D scene editing into a tractable reinforcement learning problem.