simmediumquadrupedmetric · varies

Multi-Embodiment Locomotion at Scale with extreme Embodiment Randomization

Description

We present a single, general locomotion policy trained on a diverse collection of 50 legged robots. By combining an improved embodiment-aware architecture (URMAv2) with a performance-based curriculum for extreme Embodiment Randomization, our policy learns to control millions of morphological variations. Our policy achieves zero-shot transfer to unseen real-world humanoid and quadruped robots.

Source

http://arxiv.org/abs/2509.02815v1