Search papers, labs, and topics across Lattice.
This paper introduces PHASOR, a novel framework that treats action embeddings as first-class representations rather than task-specific intermediates, addressing limitations in interpretability and transferability across humanoid robots. By leveraging the periodic nature of motion through a phase manifold and integrating pose information, the authors create a unified, embodiment-agnostic action embedding space that enhances policy learning. The results demonstrate significant improvements in cross-embodiment retrieval and performance on various downstream tasks, showcasing the framework's effectiveness in scalable robot policy learning.
A unified action embedding space across diverse humanoid robots boosts performance and interpretability, transforming how we approach robot policy learning.
Learning a good action embedding space is fundamental to scalable robot policy learning, yet existing methods treat action latents as task-specific intermediates rather than first-class representations. The resulting latents are unstructured, embodiment-specific, and weakly tied to motion semantics, limiting interpretability, controllability, and transferability across robots. We position the action embedding space itself as a first-class design target, with downstream policy quality emerging from representation quality. Exploiting motion's intrinsic periodicity, we factorize it into a phase manifold that captures cyclic structure via FFT-parametric coefficients, together with a pose branch that conditions the manifold on non-periodic configuration detail. Combined with motion-semantic distillation, this factorized structure yields a cross-embodiment motion manifold that is interpretable and embodiment-agnostic by design. Anchoring multiple humanoid robots to a shared human-pretrained manifold then produces a unified action embedding space across diverse platforms, achieving strong cross-embodiment retrieval and consistent gains on downstream robot tasks.