← Back to Benchmarks
simmediumdexterousmetric · varies

ForceVLA: Enhancing VLA Models with a Force-aware MoE for Contact-rich Manipulation

Description

Vision-Language-Action (VLA) models have advanced general-purpose robotic manipulation by leveraging pretrained visual and linguistic representations. However, they struggle with contact-rich tasks that require fine-grained control involving force, especially under visual occlusion or dynamic uncertainty. To address these limitations, we propose ForceVLA, a novel end-to-end manipulation framework that treats external force sensing as a first-class modality within VLA systems. ForceVLA introduces

Source

http://arxiv.org/abs/2505.22159v3