Search papers, labs, and topics across Lattice.
The paper introduces Trajectory Editing Residual Dataset Aggregation (TER-DAgger), a force-aware imitation learning framework for precision insertion tasks that addresses covariate shift by learning residual policies through optimization-based trajectory editing. TER-DAgger also incorporates a force-aware failure anticipation mechanism to minimize the need for continuous expert monitoring. Experiments on simulated and real-world precision insertion tasks demonstrate a 37% improvement in success rate compared to other imitation learning baselines when combined with Cartesian impedance control.
Achieve a 37% success rate boost in precision insertion by intelligently fusing human corrections with learned policies, triggered only when force discrepancies signal impending failure.
Imitation learning (IL) has shown strong potential for contact-rich precision insertion tasks. However, its practical deployment is often hindered by covariate shift and the need for continuous expert monitoring to recover from failures during execution. In this paper, we propose Trajectory Editing Residual Dataset Aggregation (TER-DAgger), a scalable and force-aware human-in-the-loop imitation learning framework that mitigates covariate shift by learning residual policies through optimization-based trajectory editing. This approach smoothly fuses policy rollouts with human corrective trajectories, providing consistent and stable supervision. Second, we introduce a force-aware failure anticipation mechanism that triggers human intervention only when discrepancies arise between predicted and measured end-effector forces, significantly reducing the requirement for continuous expert monitoring. Third, all learned policies are executed within a Cartesian impedance control framework, ensuring compliant and safe behavior during contact-rich interactions. Extensive experiments in both simulation and real-world precision insertion tasks show that TER-DAgger improves the average success rate by over 37\% compared to behavior cloning, human-guided correction, retraining, and fine-tuning baselines, demonstrating its effectiveness in mitigating covariate shift and enabling scalable deployment in contact-rich manipulation.