Francesco Ferroni

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Robotics & Embodied AI (3)World Models & Planning (3)Multimodal Models (3)Architecture Design (Transformers, SSMs, MoE) (1)

Frequent co-authors

Sanja Fidler (3)Xuanchi Ren (3)Yogesh Balaji (3)Xiaohui Zeng (3)

Papers (4)

Jun 2, 2026

NVIDIA2w ago·also ShanghaiTech, UofT

NVIDIA OmniDreams: Real-Time Generative World Model for Closed-Loop Autonomous Vehicle Simulation

A real-time generative world model can synthesize complex driving scenarios that traditional simulators struggle to capture, enabling safer and more effective evaluation of autonomous vehicle policies.

Nvidia Aarti Basant, Amlan Kar, Despoina Paschalidou +29

Robotics & Embodied AI World Models & Planning

Jun 1, 2026

NVIDIA2w ago·also Georgia Tech, HKUST, IIS Academia Sinica, JHU +6

Cosmos 3: Omnimodal World Models for Physical AI

Cosmos 3 sets a new benchmark for omnimodal models, outperforming existing state-of-the-art in both Text-to-Image and Image-to-Video tasks.

Aditi, Niket Agarwal, Arslan Ali +287

Multimodal Models Robotics & Embodied AI World Models & Planning

Jan 31, 2026

NVIDIAJan 31, 2026·also Georgia Tech, University of Southern

DuoGen: Towards General Purpose Interleaved Multimodal Generation

By decoupling MLLM instruction tuning from DiT alignment, DuoGen achieves state-of-the-art interleaved multimodal generation without costly unimodal pretraining.

Min Shi, Xiaohui Zeng, Jiannan Huang +13

Architecture Design (Transformers, SSMs, MoE)Data Curation & Synthetic Data Multimodal Models

Oct 28, 2025

NVIDIAOct 28, 2025·also BUPT, Cohere, Georgia Tech, KAIST +5

World Simulation with Video Foundation Models for Physical AI

Forget synthetic data that looks like it came from a PS2 game: NVIDIA's new Cosmos-Predict2.5 generates high-fidelity videos for training embodied AI, opening the door to more realistic and reliable simulations.

Nvidia Arslan Ali, Junjie Bai, Maciej Bala +8536