Search papers, labs, and topics across Lattice.
Seed2Scale is introduced as a self-evolving data engine for embodied AI, using a synergistic approach of small-model data collection, large-model evaluation, and target-model learning. A lightweight Vision-Language-Action model (SuperTiny) collects data, while a pre-trained Vision-Language Model acts as a Verifier for autonomous quality scoring. Experiments show Seed2Scale achieves a 131.2% performance improvement in the target model's success rate and outperforms existing data augmentation methods.
Forget expensive data collection: Seed2Scale leverages a small-model/large-model synergy to self-generate high-quality embodied AI training data, starting from just four seed demonstrations.
Existing data generation methods suffer from exploration limits, embodiment gaps, and low signal-to-noise ratios, leading to performance degradation during self-iteration. To address these challenges, we propose Seed2Scale, a self-evolving data engine that overcomes the data bottleneck through a heterogeneous synergy of "small-model collection, large-model evaluation, and target-model learning". Starting with as few as four seed demonstrations, the engine employs the lightweight Vision-Language-Action model, SuperTiny, as a dedicated collector, leveraging its strong inductive bias for robust exploration in parallel environments. Concurrently, a pre-trained Vision-Language Model is integrated as a Verifer to autonomously perform success/failure judgment and quality scoring for the massive generated trajectories. Seed2Scale effectively mitigates model collapse, ensuring the stability of the self-evolution process. Experimental results demonstrate that Seed2Scale exhibits signifcant scaling potential: as iterations progress, the success rate of the target model shows a robust upward trend, achieving a performance improvement of 131.2%. Furthermore, Seed2Scale signifcantly outperforms existing data augmentation methods, providing a scalable and cost-effective pathway for the large-scale development of Generalist Embodied AI. Project page: https://terminators2025.github.io/Seed2Scale.github.io