March 4 – March 11, 2026

World Models & Planning - Weekly Roundup

100 papers published across 8 labs.

Selected Labs publishing this week

Top Papers

Mar 11, 2026

Italian Institute of Technology (IIT)3w ago

RL-Augmented MPC for Non-Gaited Legged and Hybrid Locomotion

Unlock zero-shot sim-to-real transfer for complex legged robots by offloading gait selection to a learned policy that guides a lower-level MPC.

Andrea Patrizi, Carlo Rizzardo, Arturo Laurenzi +3

Robotics & Embodied AI Tool Use & Agents World Models & Planning

AI23w ago·also UW

Meta-Reinforcement Learning with Self-Reflection for Agentic Search

Agentic search gets a meta-RL boost: MR-Search learns to self-reflect and adapt search strategies across episodes, significantly outperforming standard RL baselines.

Teng Xiao, Yige Yuan, Hamish Ivison +6

Recommendation & Information Retrieval Tool Use & Agents World Models & Planning

3w ago

Multi-Robot Multitask Gaussian Process Estimation and Coverage

Multi-robot coverage can now handle multiple sensory demands simultaneously, with provable guarantees on performance even when those demands are initially unknown.

Lai Wei, Andrew M. McDonald, Vaibhav Srivastava

Robotics & Embodied AI World Models & Planning

M. Vahs +23w ago

Safety-critical Control Under Partial Observability: Reach-Avoid POMDP meets Belief Space Control

Achieve real-time safety-critical robot control in partially observable environments by decoupling goal reaching, information gathering, and safety into modular, certificate-based components operating directly in belief space.

M. Vahs, Joris Verhagen, Jana Tumova

Robotics & Embodied AI World Models & Planning

3w ago·also Texas A&M

ResWM: Residual-Action World Model for Visual RL

Stop wrestling with unstable action spaces: ResWM reframes visual RL by predicting incremental action adjustments, leading to smoother control and better performance.

Jseen Zhang, Gabriel Adineera, Jinzhou Tan +1

Computer Vision Robotics & Embodied AI World Models & Planning

All Papers (100)

Mar 11, 2026

Italian Institute of Technology (IIT)3w ago

RL-Augmented MPC for Non-Gaited Legged and Hybrid Locomotion

Unlock zero-shot sim-to-real transfer for complex legged robots by offloading gait selection to a learned policy that guides a lower-level MPC.

Andrea Patrizi, Carlo Rizzardo, Arturo Laurenzi +3

Robotics & Embodied AI Tool Use & Agents World Models & Planning

AI23w ago·also UW

Meta-Reinforcement Learning with Self-Reflection for Agentic Search

Agentic search gets a meta-RL boost: MR-Search learns to self-reflect and adapt search strategies across episodes, significantly outperforming standard RL baselines.

Teng Xiao, Yige Yuan, Hamish Ivison +6

Recommendation & Information Retrieval Tool Use & Agents World Models & Planning

3w ago

Multi-Robot Multitask Gaussian Process Estimation and Coverage

Multi-robot coverage can now handle multiple sensory demands simultaneously, with provable guarantees on performance even when those demands are initially unknown.

Lai Wei, Andrew M. McDonald, Vaibhav Srivastava

Robotics & Embodied AI World Models & Planning

M. Vahs +23w ago

Safety-critical Control Under Partial Observability: Reach-Avoid POMDP meets Belief Space Control

M. Vahs, Joris Verhagen, Jana Tumova

Robotics & Embodied AI World Models & Planning

3w ago·also Texas A&M

ResWM: Residual-Action World Model for Visual RL

Stop wrestling with unstable action spaces: ResWM reframes visual RL by predicting incremental action adjustments, leading to smoother control and better performance.

Jseen Zhang, Gabriel Adineera, Jinzhou Tan +1

Computer Vision Robotics & Embodied AI World Models & Planning

Yi-Kai Zhang +63w ago·also Plus MMStar RealWorldQA Method

$V_{0.5}$: Generalist Value Model as a Prior for Sparse RL Rollouts

Forget hand-tuning rollout budgets: $V_{0.5}$ dynamically allocates compute to sparse RL rollouts based on a real-time statistical test of a generalist value model's prior, slashing variance and boosting performance.

Yi-Kai Zhang, Yueqing Sun, Hongyan Hao +4

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

Chuanlong Zang +53w ago

GRACE: A Unified 2D Multi-Robot Path Planning Simulator&Benchmark for Grid, Roadmap, And Continuous Environments

Finally, a multi-robot path planning benchmark that lets you directly compare grid-based, roadmap, and continuous planners on the same tasks.

Chuanlong Zang, Anna Mannucci, Isabelle Barz +3

Eval Frameworks & Benchmarks Robotics & Embodied AI World Models & Planning

Zixing Wang +33w ago

PPGuide: Steering Diffusion Policies with Performance Predictive Guidance

Steer your robot's diffusion policy away from failure modes at inference time with a lightweight performance predictor trained via self-supervised attention.

Zixing Wang, Devesh K. Jha, A. H. Qureshi +1

Robotics & Embodied AI World Models & Planning

Zixuan Liu +63w ago

Contact Coverage-Guided Exploration for General-Purpose Dexterous Manipulation

Forget hand-crafted rewards: this new method learns dexterous manipulation by encouraging the robot hand to explore diverse contact patterns on objects, leading to impressive real-world transfer.

Zixuan Liu, Ruoyi Qiao, Chenrui Tie +4

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

Adrian Andrei Buda +33w ago

Robust Co-design Optimisation for Agile Fixed-Wing UAVs

Robust co-design optimization can significantly improve the performance of agile UAVs in real-world environments by directly incorporating uncertainty and disturbances into the design process.

Adrian Andrei Buda, Xavier Chen, Nicolò Botteghi +1

Robotics & Embodied AI World Models & Planning

Shuyao Shang +113w ago

DynVLA: Learning World Dynamics for Action Reasoning in Autonomous Driving

By forecasting compact world dynamics before taking action, DynVLA leapfrogs traditional CoT methods to achieve more informed and physically grounded autonomous driving decisions.

Shuyao Shang, Binghan Zhan, Yunfei Yan +9

Reasoning & Chain-of-Thought Robotics & Embodied AI World Models & Planning

3w ago

Novelty Adaptation Through Hybrid Large Language Model (LLM)-Symbolic Planning and LLM-guided Reinforcement Learning

Robots can now learn to manipulate novel objects in dynamic environments by using LLMs to bridge the gap between symbolic planning and reinforcement learning.

Hong Lu, Pierrick Lorang, Timothy R. Duggan +2

Robotics & Embodied AI Tool Use & Agents World Models & Planning

ETH3w ago·also CMU ML

ADMM-based Continuous Trajectory Optimization in Graphs of Convex Sets

Unlock superior trajectories in complex environments with a new ADMM-based solver that jointly optimizes spatial and temporal domains, eliminating the need for complex warm starting.

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

Tsinghua AI3w ago·also Baidu, IEEE

Recover to Predict: Progressive Retrospective Learning for Variable-Length Trajectory Prediction

Incomplete trajectory data got you down? This plug-and-play framework progressively aligns features from incomplete observations with complete ones, boosting prediction accuracy in autonomous driving scenarios.

Hao Zhou, Lu Qi, Jason Li +5

Robotics & Embodied AI World Models & Planning

University of Nebraska-Lincoln3w ago

STADA: Specification-based Testing for Autonomous Driving Agents

Achieve 2x better coverage of autonomous driving safety requirements with 6x fewer simulations by automatically generating test scenarios from formal LTLf specifications.

Joy Saha, Trey Woodlief, Sebastian Elbaum +1

Eval Frameworks & Benchmarks Robotics & Embodied AI World Models & Planning

CMU ML3w ago·also NC State, SNU, U. Hill

Muscle Synergy Priors Enhance Biomechanical Fidelity in Predictive Musculoskeletal Locomotion Simulation

Injecting muscle synergy priors into reinforcement learning drastically improves the realism of simulated human locomotion, even with limited real-world data.

I. Park, Eunsik Choi, Jangwhan Ahn +4

Robotics & Embodied AI World Models & Planning

Shriram Hari +33w ago

Dynamic Modeling and Attitude Control of a Reaction-Wheel-Based Low-Gravity Bipedal Hopper

Reaction wheels can dramatically stabilize bipedal hopping robots in low-gravity environments, enabling more consistent upright landings on irregular extraterrestrial terrains.

Shriram Hari, M. Venkata, S. Nikhil +1

Robotics & Embodied AI World Models & Planning

Xiao Liu +63w ago

SUBTA: A Framework for Supported User-Guided Bimanual Teleoperation in Structured Assembly

Achieve significantly higher accuracy and lower mental demand in bimanual teleoperation by intelligently coupling intention estimation with scene-graph task planning and context-aware motion assistance.

Xiao Liu, Prakash Baskaran, Songpo Li +4

Robotics & Embodied AI Tool Use & Agents World Models & Planning

NUS3w ago·also Imperial

Adaptive Manipulation Potential and Haptic Estimation for Tool-Mediated Interaction

Robots can now loosen screws with human-level dexterity thanks to a new framework that combines haptic estimation, online planning, and adaptive stiffness control using a parameterized Equilibrium Manifold.

Robotics & Embodied AI Tool Use & Agents World Models & Planning

Guilherme de Lima +23w ago

Bridging Simulation and Reality: AI-Enhanced Erosion Modeling with CFD and DEM

AI can bridge the gap between simulation and reality in erosion modeling, boosting prediction accuracy by fusing CFD-DEM simulations with experimental data.

Guilherme de Lima, Matthew Adams, Ramechecandane Somassoundirame

Scientific Discovery & Drug Design World Models & Planning

Fan Ding +63w ago

KnowDiffuser: A Knowledge-Guided Diffusion Planner with LM Reasoning and Prior-Informed Trajectory Initialization

By fusing language model reasoning with diffusion-based trajectory generation, KnowDiffuser leapfrogs existing autonomous driving planners on the nuPlan benchmark.

Fan Ding, Xuewen Luo, Fengze Yang +4

Reasoning & Chain-of-Thought Robotics & Embodied AI World Models & Planning

3w ago

MAVEN: A Meta-Reinforcement Learning Framework for Varying-Dynamics Expertise in Agile Quadrotor Maneuvers

A single meta-RL policy can now handle 66% mass variations and 70% rotor thrust losses in quadrotors, achieving zero-shot sim-to-real transfer for agile maneuvers.

Jin Zhou, Dongcheng Cao, Xian Wang +1

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

3w ago

Rethinking Gaussian Trajectory Predictors: Calibrated Uncertainty for Safe Planning

Gaussian trajectory predictors often lie about their confidence, but a new loss function leveraging Kernel Density Estimation can make them more honest, leading to safer autonomous navigation.

Fatemeh Cheraghi Pouria, Mahsa Golchoubian, Katie Driggs-Campbell

Robotics & Embodied AI World Models & Planning

3w ago·also Shanghai AI Lab

FutureVLA: Joint Visuomotor Prediction for Vision-Language-Action Model

By decoupling visual and motor information during pretraining, FutureVLA unlocks more effective visuomotor prediction for vision-language-action models, boosting performance without modifying downstream architectures.

Xiaoxu Xu, Ji-lu Ye, Yilun Chen +6

Multimodal Models Robotics & Embodied AI World Models & Planning

Eugene Ku +13w ago

PC-Diffuser: Path-Consistent Capsule CBF Safety Filtering for Diffusion-Based Trajectory Planner

Guaranteeing safety in diffusion-based trajectory planning is now possible by embedding a certifiable barrier function directly into the denoising loop, ensuring forward invariance and preserving the learned path geometry.

Eugene Ku, Yiwei LyuCode

Robotics & Embodied AI World Models & Planning

Mondo Robotics3w ago·also B trainable parameters, Soyeon Caren Han is the corresponding

DiT4DiT: Jointly Modeling Video Dynamics and Actions for Generalizable Robot Control

By jointly modeling video dynamics and actions, DiT4DiT achieves 10x sample efficiency and 7x faster convergence in robot policy learning, showing that video generation can be a powerful scaling proxy.

Teli Ma, Jiayu Zheng, Zifan Wang +4

Multimodal Models Robotics & Embodied AI World Models & Planning

Elisa Tosello +43w ago

Interleaving Scheduling and Motion Planning with Incremental Learning of Symbolic Space-Time Motion Abstractions

Achieve efficient task execution in shared workspaces by interleaving scheduling and motion planning, using symbolic feedback to guide the scheduler towards motion-feasible solutions.

Elisa Tosello, Arthur Bit-Monnot, Davide Lusuardi +2

Robotics & Embodied AI World Models & Planning

Mar 10, 2026

3w ago

DISPLAY: Directable Human-Object Interaction Video Generation via Sparse Motion Guidance and Multi-Task Auxiliary

Generate realistic and controllable videos of humans interacting with objects using only sparse motion cues, like wrist positions and object bounding boxes.

Jiazhi Guan, Quanwei Yang, Luying Huang +5

Computer Vision Multimodal Models World Models & Planning

3w ago

WESPR: Wind-adaptive Energy-Efficient Safe Perception&Planning for Robust Flight with Quadrotors

Drones can now proactively navigate turbulent environments thanks to a fast wind-prediction framework that integrates geometric perception and local weather data.

Khuzema Habib, P. Manjunath, Kasra Torshizi +2

Robotics & Embodied AI World Models & Planning

3w ago

ImpedanceDiffusion: Diffusion-Based Global Path Planning for UAV Swarm Navigation with Generative Impedance Control

Ditch the map: a diffusion model learns to plan UAV swarm trajectories directly from RGB images, enabling reactive and adaptive navigation in cluttered environments.

Faryal Batool, Yasheerah Yaqoot, Muhammad Ahsan Mustafa +3

Multimodal Models Robotics & Embodied AI World Models & Planning

3w ago·also CAS, PKU, SJTU

Emerging Extrinsic Dexterity in Cluttered Scenes via Dynamics-aware Policy Learning

Forget hand-crafted heuristics: this new dynamics-aware policy learns to exploit contact forces in cluttered environments, outperforming traditional methods by 25% in simulation and showing impressive sim-to-real transfer.

Yixin Zheng, Jiangran Lyu, Jiayi Chen +7

Robotics & Embodied AI World Models & Planning

Tatjana Krau +33w ago

Impact of Markov Decision Process Design on Sim-to-Real Reinforcement Learning

Physics-based dynamics models can make or break sim-to-real reinforcement learning, boosting real-world success by 50% in industrial control tasks where simplified models fail.

Tatjana Krau, Jorge Mandlmaier, Tobias Damm +1

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

Tom Danny S. Pedersen3w ago

CERES: A Probabilistic Early Warning System for Acute Food Insecurity

For the first time, a famine early warning system offers probabilistic, open-access, continuously running, machine-readable predictions with a commitment to public prospective verification.

Tom Danny S. Pedersen

Scientific Discovery & Drug Design World Models & Planning

Yijun Shen +63w ago

Reward Prediction with Factorized World States

Forget hand-engineered reward functions: this method uses language models to learn factorized world states that generalize to new goals and environments, outperforming LLM-as-a-Judge in zero-shot reward prediction.

Yijun Shen, Delong Chen, Xianming Hu +4

Robotics & Embodied AI World Models & Planning

Zhenghui Fu +33w ago

Robust Regularized Policy Iteration under Transition Uncertainty

Offline RL can be made more robust to distribution shift by directly optimizing against worst-case transition dynamics within an uncertainty set, leading to policies that avoid unreliable out-of-distribution actions.

Zhenghui Fu, Weihao Tang, Pengfei Wang +1

Robotics & Embodied AI World Models & Planning

3w ago·also NEC Laboratories America, University of Kansas

Sim2Act: Robust Simulation-to-Decision Learning via Adversarial Calibration and Group-Relative Perturbation

Stop letting simulator errors in critical regions derail your policies: Sim2Act aligns surrogate fidelity with downstream decision impact, leading to more stable and robust decision-making.

Hongyu Cao, Jinghan Zhang, Kunpeng Liu +5

Red-Teaming & Adversarial Robustness Robotics & Embodied AI World Models & Planning

3w ago

Latent-DARM: Bridging Discrete Diffusion And Autoregressive Models For Reasoning

By communicating in a shared latent space, Latent-DARM lets you combine the global planning of diffusion models with the fluency of autoregressive models, boosting reasoning accuracy by up to 14% while slashing token usage.

Lina Berrayana, Ahmed Heakl, Abdullah Sohail +1

Reasoning & Chain-of-Thought Tool Use & Agents World Models & Planning

3w ago

EXPLORE-Bench: Egocentric Scene Prediction with Long-Horizon Reasoning

MLLMs still struggle to reliably predict the long-term consequences of actions in egocentric videos, even with structured scene annotations.

Chengjun Yu, Xuhan Zhu, Chaoqun Du +4

Eval Frameworks & Benchmarks Multimodal Models World Models & Planning

Andrew Murray +33w ago

GenePlan: Evolving Better Generalized PDDL Plans using Large Language Models

LLMs can evolve surprisingly effective, interpretable Python planners that rival state-of-the-art classical planners, at a fraction of the computational cost.

Andrew Murray, Danial Dervovic, Alberto Pozanco +1

Code Generation & Program Synthesis Reasoning & Chain-of-Thought World Models & Planning

3w ago

DiffWind: Physics-Informed Differentiable Modeling of Wind-Driven Object Dynamics

Reconstructing and simulating wind-driven dynamics from video is now possible with a new differentiable framework that enforces fluid dynamics laws.

Yuanhang Lei, Boming Zhao, Zesong Yang +8

Computer Vision Robotics & Embodied AI World Models & Planning

3w ago·also CUHK, MBZUAI

See, Plan, Rewind: Progress-Aware Vision-Language-Action Models for Robust Robotic Manipulation

Robots can now recover from failures during manipulation tasks by explicitly tracking progress against spatial subgoals, without needing extra training data or models.

Tingjun Dai, Mingfei Han, Tingwen Du +4

Multimodal Models Robotics & Embodied AI World Models & Planning

Yu Li +23w ago

Trajectory Optimization for Self-Wrap-Aware Cable-Towed Planar Object Manipulation under Implicit Tension Constraints

Self-wrapping cables aren't just a nuisance in robotic manipulation; they're a feature that can be exploited for redirected torque and more efficient object control.

Yu Li, Amin Fakhari, Hamid Sadeghian

Robotics & Embodied AI World Models & Planning

Fermi National Accelerator Laboratory3w ago·also NSF, UChicago

First Estimation of Model Parameters for Neutrino-Induced Nucleon Knockout Using Simulation-Based Inference

Simulation-based inference can improve neutrino interaction model tuning beyond traditional methods, even suggesting parameter values that better fit experimental data.

Karla Tame-Narvaez, Steven Gardiner, Aleksandra Ćiprijanović +1

Scientific Discovery & Drug Design World Models & Planning

3w ago·also Fudan

LAP: A Language-Aware Planning Model For Procedure Planning In Instructional Videos

By translating visual observations into language, LAP achieves state-of-the-art procedure planning by disambiguating visually similar actions, outperforming vision-only methods.

Lei Shi, Victor Aregbede, Andreas Persson +3

Multimodal Models Natural Language Processing World Models & Planning

3w ago

Kinodynamic Motion Retargeting for Humanoid Locomotion via Multi-Contact Whole-Body Trajectory Optimization

Humanoid locomotion can be retargeted more realistically by optimizing for dynamics and contact forces, leading to better imitation learning performance.

Xiaoyu Zhang, Steven Haener, Varun Madabushi

Robotics & Embodied AI World Models & Planning

Rongxiang Zeng +13w ago

Latent World Models for Automated Driving: A Unified Taxonomy, Evaluation Framework, and Open Challenges

Latent world models for automated driving are ripe for standardization, and this paper offers a taxonomy and evaluation framework to make them decision-ready.

Rongxiang Zeng, Yongqi Dong

Eval Frameworks & Benchmarks Robotics & Embodied AI World Models & Planning

3w ago

World2Mind: Cognition Toolkit for Allocentric Spatial Reasoning in Foundation Models

Text-only foundation models can perform surprisingly well on complex 3D spatial reasoning tasks, rivaling multimodal models, when equipped with a structured spatial representation derived from 3D reconstruction.

Shouwei Ruan, Qihui Zhu, Yuxiang Zhang +1

Multimodal Models Reasoning & Chain-of-Thought World Models & Planning

3w ago·also AIT

Declarative Scenario-based Testing with RoadLogic

RoadLogic automates the creation of diverse, realistic autonomous vehicle test scenarios from declarative specifications, sidestepping the manual effort of imperative approaches.

Ezio Bartocci, Alessio Gambi, Felix Gigler +2

Code Generation & Program Synthesis Robotics & Embodied AI World Models & Planning

Nicolas Schischka +23w ago

Open-World Motion Forecasting

Autonomous vehicles can now better adapt to the messy, ever-changing real world thanks to a new motion forecasting method that learns new object classes on the fly without forgetting old ones.

Nicolas Schischka, B Ravi Kiran, Senthil Yogamani

Computer Vision Robotics & Embodied AI World Models & Planning

Chenhui Zuo +33w ago

Embodied Human Simulation for Quantitative Design and Analysis of Interactive Robotics

Forget costly physical experiments: this framework lets you simulate embodied human-robot interaction to optimize robot designs and controls, unlocking access to internal biomechanical metrics.

Chenhui Zuo, Jinhao Xu, Michael Qian Vergnolle +1

Robotics & Embodied AI World Models & Planning

Wang Honghui +63w ago

Beyond Short-Horizon: VQ-Memory for Robust Long-Horizon Manipulation in Non-Markovian Simulation Benchmarks

Forget pick-and-place: RuleSafe, a new benchmark featuring LLM-generated safe-cracking tasks, exposes the long-horizon planning weaknesses of current robot learning methods.

Wang Honghui, Jing Zhi, Ao Jicong +4

Eval Frameworks & Benchmarks Robotics & Embodied AI World Models & Planning

Mingkun Zhang +53w ago

RAE-NWM: Navigation World Model in Dense Visual Representation Space

Ditching VAE bottlenecks for dense DINOv2 features unlocks more stable and accurate visual navigation world models.

Mingkun Zhang, Wangtian Shen, Fan Zhang +3

Computer Vision Robotics & Embodied AI World Models & Planning

Haoran Yang +73w ago

ZeroWBC: Learning Natural Visuomotor Humanoid Control Directly from Human Egocentric Video

Skip the costly robot teleoperation data: ZeroWBC learns surprisingly natural humanoid control policies directly from human egocentric videos.

Haoran Yang, Jiacheng Bao, Yucheng Xin +5

Computer Vision Robotics & Embodied AI World Models & Planning

3w ago·also JHU, UCSC

Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control

LLMs can get a 27.8% boost in mathematical reasoning by fusing a hardware-efficient optimal control layer directly into their architecture, enabling planning before prediction.

Peihao Wang, Shanzhe Yang, Shan Yang +9

Distributed Systems & Hardware Reasoning & Chain-of-Thought World Models & Planning

Shiyi Chen +73w ago

SEA-Nav: Efficient Policy Learning for Safe and Agile Quadruped Navigation in Cluttered Environments

Quadruped robots can now learn to navigate complex, real-world environments in minutes, not hours, thanks to a new RL framework that prioritizes safety and efficient exploration.

Shiyi Chen, Mingye Yang, Haiyan Mao +5

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

Xiaoxing Wang +23w ago

AutoAgent: Evolving Cognition and Elastic Memory Orchestration for Adaptive Agents

AutoAgent dynamically evolves agent cognition and memory to achieve superior performance in complex, dynamic environments, without requiring external retraining.

Xiaoxing Wang, Shikun Wei, Feiyu Xiong

Reasoning & Chain-of-Thought Tool Use & Agents World Models & Planning

Yuning Wang +23w ago

MetaDAT: Generalizable Trajectory Prediction via Meta Pre-training and Data-Adaptive Test-Time Updating

Trajectory prediction models can now adapt to new environments far more effectively thanks to a meta-learning approach that dynamically adjusts learning rates based on online data characteristics.

Yuning Wang, Yuan He, Jianru Xue

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

Tianxiang Yang +23w ago

Design and Application of an A*–Fuzzy Path Planning Algorithm for Unmanned Surface Vehicle

Fuzzy logic can smooth out the sometimes jerky paths generated by A* search, leading to safer and more efficient navigation for unmanned surface vehicles.

Tianxiang Yang, Liping Wen, Yiming Jia

Computer Vision Robotics & Embodied AI World Models & Planning

Wuping Xin3w ago

Differentiable Stochastic Traffic Dynamics: Physics-Informed Generative Modelling in Transportation

By explicitly incorporating stochasticity into physics-informed traffic models, this work provides a more realistic and informative representation of traffic dynamics than traditional deterministic approaches.

Wuping Xin

Scientific Discovery & Drug Design World Models & Planning

Zhanyi Sun +13w ago

From Prior to Pro: Efficient Skill Mastery via Distribution Contractive RL Finetuning

Turn your robot's clumsy pre-trained behaviors into expert-level skills with DICE-RL, a surprisingly stable and efficient RL fine-tuning method.

Zhanyi Sun, Shuran Song

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

Manuscript received April 193w ago

Provably Safe Trajectory Generation for Manipulators Under Motion and Environmental Uncertainties

Achieve formally certified collision risk guarantees for robot manipulators in complex, uncertain environments with a novel risk-bounded motion planning framework.

Fei Meng, Zijiang Yang, Xinyu Mao +2

Robotics & Embodied AI World Models & Planning

Jiajun Cao +93w ago

EvoDriveVLA: Evolving Autonomous Driving Vision-Language-Action Model via Collaborative Perception-Planning Distillation

Autonomous driving gets a boost: EvoDriveVLA's collaborative perception-planning distillation framework significantly enhances VLA model performance by tackling perception degradation and planning instability.

Jiajun Cao, Xiaoan Zhang, Xiaobao Wei +7

Multimodal Models Robotics & Embodied AI World Models & Planning

3w ago

Hierarchical Observe-Orient-Decide-Act Enabled UAV Swarms in Uncertain Environments: Frameworks, Potentials, and Challenges

A hierarchical OODA loop architecture can significantly improve the adaptability and efficiency of UAV swarms operating in dynamic, uncertain environments.

Ziye Jia, Yao Wu, Qihui Wu +4

Robotics & Embodied AI Tool Use & Agents World Models & Planning

3w ago

Robust Spatiotemporal Motion Planning for Multi-Agent Autonomous Racing via Topological Gap Identification and Accelerated MPC

Autonomous racecars can now overtake rivals 51% faster and with 81% success by predicting their moves and planning dynamically feasible trajectories.

Mingyi Zhang, Yiqin Wang, Hongye Su

Robotics & Embodied AI World Models & Planning

Independent Research3w ago

Telogenesis: Goal Is All U Need

Forget external rewards—this agent learns to explore and adapt by prioritizing its own ignorance, surprise, and staleness, outperforming fixed strategies.

Zhuoran Deng, Yizhi Zhang, Ziyi Zhang +1

Tool Use & Agents World Models & Planning

Mar 9, 2026

Xiaoquan Sun +163w ago

AtomVLA: Scalable Post-Training for Robotic Manipulation via Predictive Latent World Models

Stop struggling with compounding errors in long-horizon robotic tasks: AtomVLA leverages LLMs and latent world models to decompose tasks and score actions, boosting success rates to 97% on LIBERO.

Xiaoquan Sun, Zetian Xu, Chenxuan Cao +14

Multimodal Models Robotics & Embodied AI World Models & Planning

Ning Liu +63w ago·also M show up to 21., Manuscript received X X

VORL-EXPLORE: A Hybrid Learning Planning Approach to Multi-Robot Exploration in Dynamic Environments

By explicitly modeling and sharing "execution fidelity" – an estimate of local navigability – VORL-EXPLORE enables multi-robot exploration that avoids bottlenecks and oscillations common in dense, dynamic environments.

Ning Liu, Sen Shen, Zheng Li +4

Robotics & Embodied AI World Models & Planning

Leipzig University3w ago·also TU Eindhoven

MRDrive: An Open Source Mixed Reality Driving Simulator for Automotive User Research

Forget expensive, inflexible physical simulators: MRDrive offers an open-source mixed reality platform for in-vehicle HCI research, blending real-world interaction with virtual environments.

Patrick Ebel, Michał Patryk Miazga, Martin Lorenz +3

Robotics & Embodied AI World Models & Planning

3w ago

Characterization, Analytical Planning, and Hybrid Force Control for the Inspire RH56DFX Hand

Turn your Inspire RH56DFX hand from a black box into a research tool with this characterization, simulation, and control pipeline that achieves 87% grasp success on diverse objects.

Xuan Tan, William Xie, N. Correll

Robotics & Embodied AI Tool Use & Agents World Models & Planning

Piyush Gupta +33w ago

Scale-Plan: Scalable Language-Enabled Task Planning for Heterogeneous Multi-Robot Teams

LLMs can be used to prune irrelevant information *before* planning, enabling efficient long-horizon multi-robot task planning that outperforms both pure LLM and hybrid LLM-PDDL approaches.

Piyush Gupta, Sangjae Bae, Jiachen Li +1

Robotics & Embodied AI Tool Use & Agents World Models & Planning

Nehar Poddar +63w ago

Embedding Classical Balance Control Principles in Reinforcement Learning for Humanoid Recovery

Humanoid robots can now recover from falls with 93% success by baking in classical balance principles into RL, enabling diverse strategies from ankle adjustments to compliant falling.

Nehar Poddar, Stephen McCrory, Luigi Penco +4

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

Prakrut Kotecha +43w ago

STRIDE: Structured Lagrangian and Stochastic Residual Dynamics via Flow Matching

By disentangling rigid-body mechanics from stochastic interaction effects, STRIDE achieves more accurate and reliable dynamics prediction for robots operating in uncertain environments.

Prakrut Kotecha, Ganga Nair B, Ganga Nair +2

Robotics & Embodied AI World Models & Planning

Yutong Shen +93w ago

MetaWorld-X: Hierarchical World Modeling via VLM-Orchestrated Experts for Humanoid Loco-Manipulation

Humanoid robots can now perform complex loco-manipulation tasks with more natural and stable movements by decomposing control into VLM-orchestrated expert policies trained with human motion priors.

Yutong Shen, Hangxu Liu, Penghui Liu +7

Multimodal Models Robotics & Embodied AI World Models & Planning

Hamish Flynn +33w ago

Posterior Sampling Reinforcement Learning with Gaussian Processes for Continuous Control: Sublinear Regret Bounds for Unbounded State Spaces

GP-PSRL can achieve sublinear regret bounds in continuous control even with unbounded state spaces, resolving prior theoretical limitations and opening the door to more complex RL settings.

Hamish Flynn, Joe Watson, Ingmar Posner +1

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

NUS3w ago·also Horizon Robotics

SPIRAL: A Closed-Loop Framework for Self-Improving Action World Models via Reflective Planning Agents

By closing the loop with explicit planning and feedback, SPIRAL overcomes the temporal drift and weak semantic grounding plaguing one-shot video generation models.

Multimodal Models Tool Use & Agents World Models & Planning

Junhua Xue +13w ago

Efficient Policy Learning with Hybrid Evaluation-Based Genetic Programming for Uncertain Agile Earth Observation Satellite Scheduling

By intelligently switching between exact and approximate evaluations during genetic programming, HE-GP slashes training time by 17.77% while simultaneously improving the quality of scheduling policies for Earth observation satellites.

Junhua Xue, Yuning Chen

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

Tommaso Giorgi +43w ago

Impact of Connectivity on Laplacian Representations in Reinforcement Learning

The approximation error of spectral RL representations is fundamentally limited by the algebraic connectivity of the state-graph, revealing a crucial topological bottleneck.

Tommaso Giorgi, Pierriccardo Olivieri, Keyue Jiang +2

Robotics & Embodied AI World Models & Planning

3w ago

The Neural Compass: Probabilistic Relative Feature Fields for Robotic Search

Forget explicit labels: this method learns object co-occurrence priors directly from unlabeled visual data, rivaling human search efficiency.

Gabriele Somaschini, Adrian Röfer, Adrian Rofer +1

Computer Vision Robotics & Embodied AI World Models & Planning

3w ago·also Unitree

FAME: Force-Adaptive RL for Expanding the Manipulation Envelope of a Full-Scale Humanoid

Humanoid robots can now maintain balance under complex external forces without force/torque sensors, thanks to a force-adaptive RL policy that learns to anticipate and compensate for disturbances.

Niraj Pudasaini, Yutong Zhang, Jensen Lavering +2

RLHF & Preference Learning Robotics & Embodied AI World Models & Planning

Matthew Y. Jiang +23w ago

Proprioceptive Safe Active Navigation and Exploration for Planetary Environments

Legged robots can now safely explore unknown, deformable terrain using only proprioceptive feedback to estimate traversability, outperforming traditional methods.

Matthew Y. Jiang, Feifei Qian, Shipeng Liu

Robotics & Embodied AI World Models & Planning

Tenny Yin +103w ago

PlayWorld: Learning Robot World Models from Autonomous Play

Robots can now learn better world models through unsupervised self-play, outperforming models trained on human data by 40% in failure prediction and 65% in real-world RL.

Tenny Yin, Zhiting Mei, Zhong Zheng +8

Computer Vision Robotics & Embodied AI World Models & Planning

N. Jørgensen3w ago

Why Channel-Centric Models are not Enough to Predict End-to-End Performance in Private 5G: A Measurement Campaign and Case Study

Ray-tracing simulators can overestimate 5G throughput even with accurate channel predictions, because they fail to capture the real-world adaptation of MIMO spatial layers.

N. Jørgensen

Robotics & Embodied AI World Models & Planning

Ximeng Tao +43w ago

NaviDriveVLM: Decoupling High-Level Reasoning and Motion Planning for Autonomous Driving

Decoupling reasoning from action generation in autonomous driving VLMs lets you beat larger end-to-end models while slashing training costs.

Ximeng Tao, Pardis Taghavi, Dimitar Filev +2

Multimodal Models Robotics & Embodied AI World Models & Planning

J. Castillo +13w ago

Interpretable Markov-Based Spatiotemporal Risk Surfaces for Missing-Child Search Planning with Reinforcement Learning and LLM-Based Quality Assurance

LLMs can provide quality assurance for reinforcement learning-based search plans in high-stakes missing-child investigations, improving the reliability of AI-driven decision support.

J. Castillo, Ravi Mukkamala

Natural Language Processing Tool Use & Agents World Models & Planning

Zhe Hong3w ago

The Boiling Frog Threshold: Criticality and Blindness in World Model-Based Anomaly Detection Under Gradual Drift

RL agents can completely miss gradual observation drift until it's too late, with a sharp "boiling frog" threshold determining when they finally wake up to the problem.

Zhe Hong

Robotics & Embodied AI World Models & Planning

Fatih Dogangun +33w ago

Bilevel Planning with Learned Symbolic Abstractions from Interaction Data

By verifying high-level symbolic plans with learned continuous dynamics, this neuro-symbolic planner achieves the speed of symbolic methods with the reliability of continuous planning.

Fatih Dogangun, Burcu Kilic, Serdar Bahar +1

Robotics & Embodied AI World Models & Planning

Yetao Li +13w ago

MoMaStage: Skill-State Graph Guided Planning and Closed-Loop Execution for Long-Horizon Indoor Mobile Manipulation

Achieve substantially higher success rates in long-horizon mobile manipulation by grounding a vision-language model within a skill-state graph, enabling logically consistent planning and closed-loop replanning.

Yetao Li, Jiapeng Xu

Robotics & Embodied AI Tool Use & Agents World Models & Planning

MIT CSAIL3w ago·also NSFC

Interactive World Simulator for Robot Policy Training and Evaluation

Forget painstakingly collecting robot data in the real world – this interactive world simulator lets you train policies that perform just as well, but entirely in simulation.

Yixuan Wang, Rhythm Syed, Fangyu Wu +9

Robotics & Embodied AI World Models & Planning

3w ago

Perception-Aware Communication-Free Multi-UAV Coordination in the Wild

Ditch the comms: This multi-UAV coordination method uses only onboard LiDAR and a perception-aware navigation framework to achieve safe and scalable operation in GNSS-denied environments like dense forests.

Manuel Boldrer, Michal Kamler, Afzal Ahmad +1

Computer Vision Robotics & Embodied AI World Models & Planning

3w ago

Integrating Lagrangian Neural Networks into the Dyna Framework for Reinforcement Learning

By enforcing physical laws, Lagrangian Neural Networks can significantly improve the accuracy and generalization of dynamics models within Model-Based Reinforcement Learning.

Shreya Das, Kundan Kumar, Muhammad Iqbal +4

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

Prajit T Rajendran +43w ago

Oracle-Guided Soft Shielding for Safe Move Prediction in Chess

By learning to predict blunders from Stockfish evaluations, OGSS enables chess agents to explore more aggressively without sacrificing tactical soundness.

Prajit T Rajendran, Fabio Arnez, Huascar Espinoza +2

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

3w ago

CRED: Counterfactual Reasoning and Environment Design for Active Preference Learning

By "imagining" new scenarios and asking "What if this were the true preference?", CRED actively designs environments and trajectories to expose differences between competing reward functions, dramatically improving preference learning.

Yi-Shiuan Tung, Gyanig Kumar, Wei Jiang +2

RLHF & Preference Learning Robotics & Embodied AI World Models & Planning

Yudai Noda +13w ago

From Reactive to Map-Based AI: Tuned Local LLMs for Semantic Zone Inference in Object-Goal Navigation

Ditch the reactive agent: a Llama-2 model fine-tuned to infer semantic zones from object observations enables systematic exploration via TSP optimization, dramatically boosting ObjectNav performance.

Yudai Noda, Kanji Tanaka

Robotics & Embodied AI Tool Use & Agents World Models & Planning

3w ago

AutoTraces: Autoregressive Trajectory Forecasting via Multimodal Large Language Models

By tokenizing trajectories into LLM-friendly point tokens and embeddings, AutoTraces unlocks SOTA long-horizon trajectory forecasting without manual annotation.

Teng Wang, Yanting Lu, Ruize Wang

Multimodal Models Robotics & Embodied AI World Models & Planning

Lingpeng Chen +43w ago

Hierarchical Multi-Modal Planning for Fixed-Altitude Sparse Target Search and Sampling

Fixed-altitude underwater vehicles can now efficiently search and sample sparse coral colonies thanks to a hierarchical planner that fuses acoustic and visual data.

Lingpeng Chen, Yuchen Zheng, Apple Pui-Yi Chui +2

Multimodal Models Robotics & Embodied AI World Models & Planning

NVIDIA3w ago

PhaForce: Phase-Scheduled Visual-Force Policy Learning with Slow Planning and Fast Correction for Contact-Rich Manipulation

Achieve a 40% jump in success rates on real-world contact-rich manipulation by intelligently scheduling force feedback into visual-motor policies.

Mingxin Wang, Zhirun Yue, Renhao Lu +6

Computer Vision Robotics & Embodied AI World Models & Planning

Jialin Ying +43w ago

Less is More: Robust Zero-Communication 3D Pursuit-Evasion via Representational Parsimony

Stripping away seemingly helpful information from agents' observations can actually *improve* the robustness of multi-agent coordination in communication-constrained environments.

Jialin Ying, Zhihao Li, Zicheng Dong +2

Red-Teaming & Adversarial Robustness Robotics & Embodied AI World Models & Planning

Makoto Sato +23w ago

SAIL: Test-Time Scaling for In-Context Imitation Learning with VLM

Scaling test-time compute can dramatically improve the success rate of robot imitation learning, achieving up to 95% on complex manipulation tasks.

Makoto Sato, Yujin Tang, So Kuroki

Multimodal Models Robotics & Embodied AI World Models & Planning

Tsinghua AI3w ago·also M steps for a fair comparison., UChicago

Model-based Offline RL via Robust Value-Aware Model Learning with Implicitly Differentiable Adaptive Weighting

RAMBO's instability got you down? ROMI offers a robust, value-aware model learning approach with implicitly differentiable adaptive weighting that outperforms RAMBO and other SOTA methods in offline RL benchmarks.

Zhongjian Qiao, Jiafei Lyu, Boxiang Lyu +3

Robotics & Embodied AI World Models & Planning

Tiago Rodrigues de Almeida +23w ago

Context-free Self-Conditioned GAN for Trajectory Forecasting

A novel self-conditioned GAN learns trajectory forecasting without context, outperforming supervised methods in poorly labeled data by discovering behavioral modes in the discriminator's feature space.

Tiago Rodrigues de Almeida, Eduardo Gutierrez Maestro, Oscar Martinez Mozos

Computer Vision Robotics & Embodied AI World Models & Planning