Search papers, labs, and topics across Lattice.
This paper introduces Flow Reversal Steering (FRS), a novel method that enhances robotic generalist policies by leveraging latent noise from suboptimal actions to guide the selection of more effective actions. By transforming coarse semantic guidance from humans or vision-language models into actionable robot commands, FRS significantly improves zero-shot control capabilities. The approach demonstrates remarkable efficacy, achieving up to a 95% increase in task success rates with minimal training time and enabling better performance in reinforcement learning scenarios where traditional methods struggle.
Flow Reversal Steering transforms vague human commands into precise robotic actions, achieving up to 95% higher success rates in real-world tasks with minimal training.
Generalist policies can learn a wide range of skills from diverse robot datasets. In order to solve or improve on challenging news tasks, we need a way to infer and invoke the appropriate actions from the policy's rich behavioral prior, especially when directly commanding the policy fails. We focus on flow matching generalists and propose Flow Reversal Steering (FRS): a method that takes suboptimal but ``reasonable''actions, finds their latent noises by passing them through the flow policy in reverse, and maps them to nearby generalist action modes. We evaluate FRS across many simulated and real-world manipulation settings. First, FRS can turn coarse semantic guidance from humans or vision-language models (VLMs) into corresponding good robot actions, improving zero-shot control. These gains can be distilled with behavioral cloning by training an auxiliary policy to output noises that the generalist maps to good actions -- showing up to 95% absolute task success rate boosts in under a minute of training. Finally, FRS enables policy improvement by bootstrapping reinforcement learning with semantic knowledge, improving on several tasks that standard RL fails to improve on.