← Back to Benchmarks
simmediumhumanoidmetric · varies
SENTINEL: A Fully End-to-End Language-Action Model for Humanoid Whole Body Control
Description
Existing humanoid control systems often rely on teleoperation or modular generation pipelines that separate language understanding from physical execution. However, the former is entirely human-driven, and the latter lacks tight alignment between language commands and physical behaviors. In this paper, we present SENTINEL, a fully end-to-end language-action model for humanoid whole-body control. We construct a large-scale dataset by tracking human motions in simulation using a pretrained whole b