← Back to Benchmarks
simmediumvision-robotmetric · varies

ECHO: Edge-Cloud Humanoid Orchestration for Language-to-Motion Control

Description

We present ECHO, an edge--cloud framework for language-driven whole-body control of humanoid robots. A cloud-hosted diffusion-based text-to-motion generator synthesizes motion references from natural language instructions, while an edge-deployed reinforcement-learning tracker executes them in closed loop on the robot. The two modules are bridged by a compact, robot-native 38-dimensional motion representation that encodes joint angles, root planar velocity, root height, and a continuous 6D root o

Source

http://arxiv.org/abs/2603.16188v1