Search papers, labs, and topics across Lattice.
The paper introduces Style-Conditioned Diffusion Policy (SCDP), a framework that balances legibility and efficiency in robot motion by modulating a pre-trained diffusion policy based on environmental ambiguity. SCDP uses a lightweight scene encoder and conditioning predictor trained post-hoc to guide the diffusion process towards either legible or efficient trajectories. Experiments on manipulation and navigation tasks demonstrate that SCDP improves legibility in ambiguous scenarios while maintaining efficiency when legibility is not required, without retraining the base policy.
Robots can now dynamically adjust their movements for legibility versus efficiency on the fly, without retraining, by using a lightweight module that detects environmental ambiguity and modulates a diffusion policy.
Striking a balance between efficiency and transparent motion is a core challenge in human-robot collaboration, as highly expressive movements often incur unnecessary time and energy costs. In collaborative environments, legibility allows a human observer a better understanding of the robot's actions, increasing safety and trust. However, these behaviors result in sub-optimal and exaggerated trajectories that are redundant in low-ambiguity scenarios where the robot's goal is already obvious. To address this trade-off, we propose Style-Conditioned Diffusion Policy (SCDP), a modular framework that constrains the trajectory generation of a pre-trained diffusion model toward either legibility or efficiency based on the environment's configuration. Our method utilizes a post-training pipeline that freezes the base policy and trains a lightweight scene encoder and conditioning predictor to modulate the diffusion process. At inference time, an ambiguity detection module activates the appropriate conditioning, prioritizing expressive motion only for ambiguous goals and reverting to efficient paths otherwise. We evaluate SCDP on manipulation and navigation tasks, and results show that it enhances legibility in ambiguous settings while preserving optimal efficiency when legibility is unnecessary, all without retraining the base policy.