← Back to Benchmarks
simmediumquadrupedmetric · varies

MobileVLA-R1: Reinforcing Vision-Language-Action for Mobile Robots

Description

Grounding natural-language instructions into continuous control for quadruped robots remains a fundamental challenge in vision language action. Existing methods struggle to bridge high-level semantic reasoning and low-level actuation, leading to unstable grounding and weak generalization in the real world. To address these issues, we present MobileVLA-R1, a unified vision-language-action framework that enables explicit reasoning and continuous control for quadruped robots. We construct MobileVLA

Source

http://arxiv.org/abs/2511.17889v1