← Back to Benchmarks
simmediummobile-manipulationmetric · varies

HoMMI: Learning Whole-Body Mobile Manipulation from Human Demonstrations

Description

We present Whole-Body Mobile Manipulation Interface (HoMMI), a data collection and policy learning framework that learns whole-body mobile manipulation directly from robot-free human demonstrations. We augment UMI interfaces with egocentric sensing to capture the global context required for mobile manipulation, enabling portable, robot-free, and scalable data collection. However, naively incorporating egocentric sensing introduces a larger human-to-robot embodiment gap in both observation and ac

Source

http://arxiv.org/abs/2603.03243v1