Stanford HAIPurdueJun 17, 2026arXiv:2606.18589

DREAM-Chunk: Reactive Action Chunking with Latent World Model

Wenxi Chen, Kaidi Zhang, Chi Lin, Zhiyuan Zhang, Yu She, Yuejiang Liu, Raymond A. Yeh, Shaoshuai Mou, Yan Gu

AI Summary

This paper introduces DREAM-Chunk, a method that enhances action chunking in vision-language-action models by integrating a lightweight latent world model at test time. By sampling multiple candidate action chunks and predicting their latent futures, DREAM-Chunk effectively mitigates the brittleness of open-loop execution under stochastic conditions and hardware errors. Experimental results on the Kinetix benchmark and various manipulation tasks demonstrate significant improvements in robustness and reactivity, particularly in scenarios with high action noise and corrective behaviors in demonstrations.

Key Contribution

DREAM-Chunk transforms action chunking by leveraging latent world models to enhance robustness against stochastic dynamics without the need for policy retraining.

Abstract

Action chunking has become a common interface for vision-language-action (VLA) models, enabling low-frequency policy inference to drive high-frequency robot execution. However, once an action chunk is committed, its open-loop execution can be brittle under stochastic dynamics, hardware execution errors, and partial observability. We propose DREAM-Chunk, a test-time scaling method that augments chunking-based policies with a lightweight latent world model, without requiring additional policy fine-tuning. At test time, DREAM-Chunk samples multiple candidate action chunks, rolls out their predicted latent futures, and selects actions from the chunk whose predicted state best matches the observed rollout. In this way, DREAM-Chunk uses additional test-time computation to cover multiple plausible stochastic futures and improve reactivity during long-horizon chunk execution. On the Kinetix benchmark, DREAM-Chunk improves robustness under increasing action noise and benefits from larger candidate sample sizes, especially when demonstrations contain corrective behaviors. We further validate DREAM-Chunk on four manipulation tasks across two robot platforms and two VLA policies under various sources of stochasticity. Across simulation and hardware experiments, DREAM-Chunk improves the robustness of action-chunking policies in stochastic dynamics.

Multimodal Models Robotics & Embodied AI World Models & Planning

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

DREAM-Chunk: Reactive Action Chunking with Latent World Model

Related Papers