NVIDIA Research

×World Models & Planning

9 papers from NVIDIA Research on World Models & Planning

Apr 30, 2026

NVIDIA3w ago·also National Center for Childhood Diabetes, Pheno.AI, Schneider Children's Medical Center of Israel, TAU +3

Simulating clinical interventions with a generative multimodal model of human physiology

A generative model of human physiology not only beats existing clinical risk scores at predicting disease, but also accurately simulates the effects of clinical interventions, paving the way for personalized medicine.

Guy Lutsker, G. Sapir, Gal Sapir +12

Multimodal Models Scientific Discovery & Drug Design World Models & Planning

Apr 29, 2026

3w ago·also NVIDIA

Three-Step Nav: A Hierarchical Global-Local Planner for Zero-Shot Vision-and-Language Navigation

VLN agents can navigate more accurately in zero-shot settings by "looking forward, now, and backward," mimicking human navigational strategies.

Wanrong Zheng, Yunhao Ge, Laurent Itti

Multimodal Models Robotics & Embodied AI World Models & Planning

Apr 28, 2026

CMU ML3w ago·also NVIDIA, Georgia Tech, Princeton

KinDER: A Physical Reasoning Benchmark for Robot Learning and Planning

Existing robotic methods falter in tackling fundamental physical reasoning challenges, as evidenced by KinDER's rigorous benchmark evaluation.

Yixuan Huang, Bowen Li, Vaibhav Saxena +12

Eval Frameworks & Benchmarks Robotics & Embodied AI World Models & Planning

Apr 20, 2026

TimeTrace LabsApr 20, 2026·also NVIDIA

Sonata: A Hybrid World Model for Inertial Kinematics under Clinical Data Scarcity

Sonata outperforms traditional models in clinical kinematic assessments, achieving better fall-risk predictions with a fraction of the parameters.

Blaise Delaney, Salil Patel, Yuji Xing +2

Data Curation & Synthetic Data Training Efficiency & Optimization World Models & Planning

Apr 15, 2026

NVIDIAApr 15, 2026·also TU Delft

Beyond Conservative Automated Driving in Multi-Agent Scenarios via Coupled Model Predictive Control and Deep Reinforcement Learning

Fusing MPC with RL yields safer and more efficient autonomous driving at intersections, outperforming both standalone MPC and end-to-end RL, and surprisingly generalizing better to new scenarios.

Saeed Rahmani, Gozde Korpe, Gözde Körpe +9

Robotics & Embodied AI World Models & Planning

Apr 14, 2026

NVIDIAApr 14, 2026·also Sydney, UofT

RoboLab: A High-Fidelity Simulation Benchmark for Analysis of Task Generalist Policies

RoboLab exposes critical performance gaps in leading robotic models, revealing that high-fidelity simulations can better assess generalization than traditional benchmarks.

Xuning Yang, Rishit Dagli, Alex Zook +5

Eval Frameworks & Benchmarks Robotics & Embodied AI World Models & Planning

Apr 8, 2026

NVIDIAApr 8, 2026·also UIUC

MoRight: Motion Control Done Right

Finally, a video generation model lets you puppeteer objects and their reactions independently, all while freely moving the camera.

Shaowei Liu, Xuanchi Ren, Tianchang Shen +4

Computer Vision Multimodal Models Robotics & Embodied AI+1

Mar 4, 2026

NVIDIAMar 4, 2026·also UT Austin

RoboCasa365: A Large-Scale Simulation Framework for Training and Benchmarking Generalist Robots

Training generalist robots just got a whole lot easier: RoboCasa365 offers a massive, diverse, and reproducible benchmark for household mobile manipulation.

Soroush Nasiriany, Sepehr Nasiriany, Abhiram Maddukuri +1

Eval Frameworks & Benchmarks Robotics & Embodied AI World Models & Planning

Oct 28, 2025

NVIDIAOct 28, 2025·also BUPT, Cohere, Georgia Tech, KAIST +5

World Simulation with Video Foundation Models for Physical AI

Forget synthetic data that looks like it came from a PS2 game: NVIDIA's new Cosmos-Predict2.5 generates high-fidelity videos for training embodied AI, opening the door to more realistic and reliable simulations.

Nvidia Arslan Ali, Junjie Bai, Maciej Bala +8536

Multimodal Models Robotics & Embodied AI World Models & Planning

Search

NVIDIA Research