Stanford HAIGeorgia TechMar 18, 2026arXiv:2603.17300

ReSteer: Quantifying and Refining the Steerability of Multitask Robot Policies

Zhenyang Chen, Alan Tian, Alan Tian, Liquan Wang, Liquang Wang, Benjamin Joffe, Benjamin Joffe, Yingyan Lin, Yingyan Celine Lin, Yuxiao Chen, Yuxiao Chen, Siddharth Karamcheti, Siddharth Karamcheti, Danfei Xu

AI Summary

The paper introduces ReSteer, a framework to quantify and improve task steerability in multitask robot policies, addressing the issue where robots fail to respond to new instructions mid-execution. ReSteer identifies low-steerability states using a novel estimator, synthesizes motion segments from these states with a steerable data generator, and refines the policy through a self-refinement pipeline. Experiments on LIBERO simulation and real-world scenarios demonstrate that ReSteer improves steerability by 11% and is critical for interactive use.

Key Contribution

Robots often ignore your commands mid-task, but ReSteer offers a way to fix this by pinpointing and patching the "blind spots" in their training data.

Abstract

Despite strong multi-task pretraining, existing policies often exhibit poor task steerability. For example, a robot may fail to respond to a new instruction ``put the bowl in the sink"when moving towards the oven, executing ``close the oven", even though it can complete both tasks when executed separately. We propose ReSteer, a framework to quantify and improve task steerability in multitask robot policies. We conduct an exhaustive evaluation of state-of-the-art policies, revealing a common lack of steerability. We find that steerability is associated with limited overlap among training task trajectory distributions, and introduce a proxy metric to measure this overlap from policy behavior. Building on this insight, ReSteer improves steerability via three components: (i) a steerability estimator that identifies low-steerability states without full-rollout evaluation, (ii) a steerable data generator that synthesizes motion segments from these states, and (iii) a self-refinement pipeline that improves policy steerability using the generated data. In simulation on LIBERO, ReSteer improves steerability by 11\% over 18k rollouts. In real-world experiments, we show that improved steerability is critical for interactive use, enabling users to instruct robots to perform any task at any time. We hope this work motivates further study on quantifying steerability and data collection strategies for large robot policies.

Eval Frameworks & Benchmarks Robotics & Embodied AI Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References28

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

ReSteer: Quantifying and Refining the Steerability of Multitask Robot Policies

Related Papers