Search papers, labs, and topics across Lattice.
The paper introduces Hy-Embodied-0.5-VLA (HyVLA-0.5), an integrated system that encompasses the entire robot learning pipeline, from data collection to real-world deployment. This comprehensive approach is designed to enhance the capabilities of vision-language-action models in practical robotic applications. Key results demonstrate significant improvements in performance and adaptability when transitioning from simulated environments to real-world tasks.
A fully integrated robot learning stack that bridges the gap from simulation to real-world deployment, enhancing the efficacy of vision-language-action models.
In this report, we present Hy-Embodied-0.5-VLA, abbreviated as HyVLA-0.5, an end-to-end system that spans the full robot learning stack: data collection, model design, continued pre-training and supervised fine-tuning, RL post-training, and real-world deployment. Each component serves a distinct role in this stack.