Search papers, labs, and topics across Lattice.
ForceVLA2, a vision-language-action framework, is introduced to enhance contact-rich manipulation by incorporating hybrid force-position control and explicit force awareness. The framework uses force-based prompts within a VLM expert to create force-aware task concepts and employs a Cross-Scale Mixture-of-Experts (MoE) to fuse these concepts with real-time interaction forces. Evaluated on a newly constructed dataset, ForceVLA2-Dataset, ForceVLA2 demonstrates significant improvements in success rates and reliability compared to baselines like pi0 and pi0.5.
Robots can now perform contact-rich tasks with significantly improved success rates (up to 48% better) by explicitly reasoning about and regulating interaction forces, thanks to a novel vision-language-action framework.
Embodied intelligence for contact-rich manipulation has predominantly relied on position control, while explicit awareness and regulation of interaction forces remain under-explored, limiting stability, precision, and robustness in real-world tasks. We propose ForceVLA2, an end-to-end vision-language-action framework that equips robots with hybrid force-position control and explicit force awareness. ForceVLA2 introduces force-based prompts into the VLM expert to construct force-aware task concepts across stages, and employs a Cross-Scale Mixture-of-Experts (MoE) in the action expert to adaptively fuse these concepts with real-time interaction forces for closed-loop hybrid force-position regulation. To support learning and evaluation, we construct ForceVLA2-Dataset, containing 1,000 trajectories over 5 contact-rich tasks, including wiping, pressing, and assembling, with multi-view images, task prompts, proprioceptive state, and force signals. Extensive experiments show that ForceVLA2 substantially improves success rates and reliability in contact-rich manipulation, outperforming pi0 and pi0.5 by 48.0% and 35.0%, respectively, across the 5 tasks, and mitigating common failure modes such as arm overload and unstable contact, thereby actively advancing force-aware interactive physical intelligence in VLAs. The project page is available at https://sites.google.com/view/force-vla2/home.