Stanford HAIPIJun 11, 2026arXiv:2606.13675

Improving Robotic Generalist Policies via Flow Reversal Steering

Andy Tang, Andy Tang, William Chen, William Chen, Andrew Wagenmaker, Andrew Wagenmaker, Chelsea Finn, Chelsea Finn, Chelsea Finn, Sergey Levine, Sergey Levine

AI Summary

This paper introduces Flow Reversal Steering (FRS), a novel method that enhances robotic generalist policies by leveraging latent noise from suboptimal actions to guide the selection of more effective actions. By transforming coarse semantic guidance from humans or vision-language models into actionable robot commands, FRS significantly improves zero-shot control capabilities. The approach demonstrates remarkable efficacy, achieving up to a 95% increase in task success rates with minimal training time and enabling better performance in reinforcement learning scenarios where traditional methods struggle.

Key Contribution

Flow Reversal Steering transforms vague human commands into precise robotic actions, achieving up to 95% higher success rates in real-world tasks with minimal training.

Abstract

Generalist policies can learn a wide range of skills from diverse robot datasets. In order to solve or improve on challenging news tasks, we need a way to infer and invoke the appropriate actions from the policy's rich behavioral prior, especially when directly commanding the policy fails. We focus on flow matching generalists and propose Flow Reversal Steering (FRS): a method that takes suboptimal but ``reasonable''actions, finds their latent noises by passing them through the flow policy in reverse, and maps them to nearby generalist action modes. We evaluate FRS across many simulated and real-world manipulation settings. First, FRS can turn coarse semantic guidance from humans or vision-language models (VLMs) into corresponding good robot actions, improving zero-shot control. These gains can be distilled with behavioral cloning by training an auxiliary policy to output noises that the generalist maps to good actions -- showing up to 95% absolute task success rate boosts in under a minute of training. Finally, FRS enables policy improvement by bootstrapping reinforcement learning with semantic knowledge, improving on several tasks that standard RL fails to improve on.

Robotics & Embodied AI

Citation Metrics

Citations0

Influential citations0

References80

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Improving Robotic Generalist Policies via Flow Reversal Steering

Related Papers