CMU Machine Learning

×World Models & Planning

21 papers from CMU Machine Learning on World Models & Planning

Mar 18, 2026

CMU ML2w ago·also NII

RPMS: Enhancing LLM-Based Embodied Planning through Rule-Augmented Memory Synergy

LLMs in embodied environments get a massive boost from structured rules, with rule retrieval alone contributing +14.9 pp to single-trial success.

Zhenhang Yuan, Shenghai Yuan, Lihua Xie

Robotics & Embodied AI Tool Use & Agents World Models & Planning

Mar 17, 2026

CMU ML2w ago·also Keio

Ground Reaction Inertial Poser: Physics-based Human Motion Capture from Sparse IMUs and Insole Pressure Sensors

By fusing IMU and insole pressure data within a physics simulation, GRIP achieves more physically plausible human motion capture than IMU-only methods.

Ryosuke Hori, Jyun-Ting Song, Zhengyi Luo +4

Robotics & Embodied AI World Models & Planning

CMU ML2w ago

BrickSim: A Physics-Based Simulator for Manipulating Interlocking Brick Assemblies

Accurately simulating the snap-fit mechanics of interlocking bricks, BrickSim unlocks a new level of realism for robotic manipulation research involving complex assemblies.

Hao Wen, Haowei Wen, Ruixuan Liu +5

Robotics & Embodied AI World Models & Planning

AI22w ago·also CMU ML, NVIDIA

MolmoB0T: Large-Scale Simulation Enables Zero-Shot Manipulation

Forget expensive real-world data collection: a massive, diverse synthetic dataset enables surprisingly effective zero-shot transfer for robotic manipulation.

Abhay Deshpande, Maya Guru, Rose Hendrix +22

Data Curation & Synthetic Data Robotics & Embodied AI World Models & Planning

Mar 16, 2026

CMU ML2w ago·also Shanghai AI Lab

RoCo Challenge at AAAI 2026: Benchmarking Robotic Collaborative Manipulation for Assembly Towards Industrial Automation

Strategic recovery from failures is key to deploying robots for complex assembly tasks in the real world.

Haichao Liu, Yuheng Zhou, Zhenyu Wu +17

Eval Frameworks & Benchmarks Robotics & Embodied AI World Models & Planning

Mar 11, 2026

CMU ML3w ago·also NC State, SNU, U. Hill

Muscle Synergy Priors Enhance Biomechanical Fidelity in Predictive Musculoskeletal Locomotion Simulation

Injecting muscle synergy priors into reinforcement learning drastically improves the realism of simulated human locomotion, even with limited real-world data.

I. Park, Eunsik Choi, Jangwhan Ahn +4

Robotics & Embodied AI World Models & Planning

Mar 8, 2026

CMU ML3w ago·also NII, UQ

PanoDP: Learning Collision-Free Navigation with Panoramic Depth and Differentiable Physics

Panoramic depth perception and differentiable physics unlock surprisingly robust collision avoidance, even generalizing to unseen simulation environments.

Hao Zhong, Pei Chi, Jiang Zhao +4

Computer Vision Robotics & Embodied AI World Models & Planning

Mar 5, 2026

CMU ML3w ago

Cheap Thrills: Effective Amortized Optimization Using Inexpensive Labels

Unlock up to 59x cost reductions in optimization by pretraining ML surrogates with cheap, imperfect labels and then refining them with self-supervision.

Khai Nguyen, Petros Ellinas, P. Ellinas +3

Training Efficiency & Optimization World Models & Planning

Mar 4, 2026

CMU MLMar 4, 2026·also BAIR, Meta AI, Brown, Lambda +1

Latent Particle World Models: Self-supervised Object-centric Stochastic Dynamics Modeling

Unsupervised discovery of object keypoints and dynamics directly from video unlocks state-of-the-art world models applicable to decision-making.

Tal Daniel, Carl Qi, Dan Haramati +5

Computer Vision Robotics & Embodied AI World Models & Planning

CMU MLMar 4, 2026

Learning Hip Exoskeleton Control Policy via Predictive Neuromusculoskeletal Simulation

Skip the motion-capture grind: train your hip exoskeleton controller entirely in simulation and still see it work on real hardware.

I. Park, Ilseung Park, Changseob Song +1

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

CMU MLMar 4, 2026

Whole-Body Safe Control of Robotic Systems with Koopman Neural Dynamics

Achieve real-time safe control of complex robots by representing their dynamics as a linear system in a higher-dimensional space, enabling fast quadratic programming for both tracking and obstacle avoidance.

Sebin Jung, Abulikemu Abuduweili, Jiaxing Li

Robotics & Embodied AI World Models & Planning

Mar 3, 2026

CMU MLMar 3, 2026

What Capable Agents Must Know: Selection Theorems for Robust Decision-Making under Uncertainty

Forget hand-engineering world models – this work proves that competent agents *must* internally represent the world in a structured, predictive way to minimize regret under uncertainty.

Aran Nayebi

Scalable Oversight & Alignment Theory Tool Use & Agents World Models & Planning

CMU MLMar 3, 2026·also NII, UQ

Watch Your Step: Learning Semantically-Guided Locomotion in Cluttered Environment

Legged robots can now tiptoe around your expensive gadgets, thanks to a new RL framework that combines semantic understanding with low-level control to avoid stepping on designated objects.

Denan Liang, Yuan Zhu, Yuanzhe Zhu +4

Robotics & Embodied AI World Models & Planning

Mar 2, 2026

CMU MLMar 2, 2026·also R learns slowly at the early stage due, TAU

Pri4R: Learning World Dynamics for Vision-Language-Action Models with Privileged 4D Representation

VLA models struggle with physical reasoning, but Pri4R's simple trick of predicting 3D point tracks during training boosts performance by up to 40% on manipulation tasks, without adding any inference overhead.

Jisoo Kim, J. Cho, Jungbin Cho +17

Multimodal Models Robotics & Embodied AI World Models & Planning

Feb 24, 2026

Feb 24, 2026·also CMU ML, NVIDIA, Texas A&M

Stop-Think-AutoRegress: Language Modeling with Latent Diffusion Planning

By pausing to "think" with latent diffusion, STAR-LDM achieves superior language understanding, narrative coherence, and controllable generation compared to standard autoregressive models of similar size.

Justin Lovelace, Justin Lovelace, Christian Belardi +9

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing World Models & Planning

Feb 23, 2026

CMU MLFeb 23, 2026

NovaPlan: Zero-Shot Long-Horizon Manipulation via Closed-Loop Video Language Planning

Robots can now perform intricate assembly tasks and recover from errors in real-time, without any training, by fusing vision-language models with video-based kinematic priors for action planning.

Jiahui Fu, Junyu Nan, Lingfeng Sun +4

Multimodal Models Robotics & Embodied AI World Models & Planning

Feb 23, 2026·also CMU ML

Contextual Safety Reasoning and Grounding for Open-World Robots

Robots can now adapt their safety behavior on the fly in response to changing real-world contexts, without needing pre-programmed rules or maps.

Zachary Ravichadran, David Snyder, Alexander Robey +3

Constitutional AI & AI Ethics Robotics & Embodied AI World Models & Planning

Feb 17, 2026

CMU MLFeb 17, 2026

Lifelong Scalable Multi-Agent Realistic Testbed and A Comprehensive Study on Design Choices in Lifelong AGV Fleet Management Systems

Stop guessing about AGV fleet management: LSMART offers a realistic, open-source simulator to benchmark MAPF algorithms in complex, lifelong scenarios, revealing the critical design choices that make or break performance.

Jingtian Yan, Yulun Zhang, Zhenting Liu +3

Open-Source Models & Weights Robotics & Embodied AI World Models & Planning

Feb 16, 2026

CMU MLFeb 16, 2026·also DeepMind

BPP: Long-Context Robot Imitation Learning by Focusing on Key History Frames

Robots can now learn long-horizon tasks far more effectively by distilling complex histories into a few key visual moments, outperforming standard imitation learning by 70% on real-world tasks.

Max Sobol Mark, Jacky Liang, Maria Attarian +2

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

Feb 13, 2026

Feb 13, 2026·also CMU ML, DAMO, IEEE, PKU +2

RynnBrain: Open Embodied Foundation Models

RynnBrain leapfrogs existing embodied foundation models, offering a unified, open-source spatiotemporal model that excels at physically grounded reasoning and planning across a wide range of benchmarks.

Ronghao Dang, Jiayan Guo, Jiayan Guo +30

Multimodal Models Robotics & Embodied AI World Models & Planning

Nov 12, 2025

Institute of Foundation ModelsNov 12, 2025·also CMU ML, Shenzhen University, UCSD

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

Forget rigid game environments – PAN lets you simulate open-world scenarios with language-specified actions and long-term visual coherence, opening the door to more realistic AI training.

Pan Team Institute of Foundation Models Jiannan Xiang, Yi Gu, Zihan Liu +305

Computer Vision Multimodal Models World Models & Planning

Search

CMU Machine Learning