Search papers, labs, and topics across Lattice.
4
36
5
7
A real-time generative world model can synthesize complex driving scenarios that traditional simulators struggle to capture, enabling safer and more effective evaluation of autonomous vehicle policies.
Cosmos 3 sets a new benchmark for omnimodal models, outperforming existing state-of-the-art in both Text-to-Image and Image-to-Video tasks.
By decoupling MLLM instruction tuning from DiT alignment, DuoGen achieves state-of-the-art interleaved multimodal generation without costly unimodal pretraining.
Forget synthetic data that looks like it came from a PS2 game: NVIDIA's new Cosmos-Predict2.5 generates high-fidelity videos for training embodied AI, opening the door to more realistic and reliable simulations.