Search papers, labs, and topics across Lattice.
This paper introduces a novel framework for robot skill adaptation using Gaussian Processes (GPs) with sparse via-points to model trajectories, leveraging analytical derivatives to preserve the kinematic profile. Three adaptation methods are proposed: an optimization agent, a behavior cloning agent trained on the optimization agent's outputs, and a reinforcement learning agent that modifies via-points while maintaining the kinematic profile. Evaluations across three tasks in simulation and hardware demonstrate that the proposed GP-based methods outperform benchmarks in success rates, achieving high cosine similarity and low velocity magnitude errors, indicating strong kinematic profile preservation.
Robots can now adapt demonstrated skills to significantly different starting conditions thanks to a Gaussian Process representation that preserves kinematic profiles.
General robot skill adaptation requires expressive representations robust to varying task configurations. While recent learning-based skill adaptation methods refined via Reinforcement Learning (RL), have shown success, existing skill models often lack sufficient representational capacity for anything beyond minor environmental changes. In contrast, Gaussian Process (GP)-based skill modelling provides an expressive representation with useful analytical properties; however, adaptation of GP-based skills remains underexplored. This paper proposes a novel, robust skill adaptation framework that utilises GPs with sparse via-points for compact and expressive modelling. The model considers the trajectory's poses and leverages its first and second analytical derivatives to preserve the skill's kinematic profile. We present three adaptation methods to cater for the variability between initial and observed configurations. Firstly, an optimisation agent that adjusts the path's via-points while preserving the demonstration velocity. Second, a behaviour cloning agent trained to replicate output trajectories from the optimisation agent. Lastly, an RL agent that has learnt to modify via-points whilst maintaining the kinematic profile and enabling online capabilities. Evaluated across three tasks (drawer opening, cube-pushing and bar manipulation) in both simulation and hardware, our proposed methods outperform every benchmark in success rates. Furthermore, the results demonstrate that the GP-based representation enables all three methods to attain high cosine similarity and low velocity magnitude errors, indicating strong preservation of the kinematic profile. Overall, our formulation provides a compact representation capable of adapting to large deviations from a single demonstrated skill.