← Back to Benchmarks
simmediumhumanoidmetric · varies
Task-Specified Compliance Bounds for Humanoids via Lipschitz-Constrained Policies
Description
Reinforcement learning (RL) has demonstrated substantial potential for humanoid bipedal locomotion and the control of complex motions. To cope with oscillations and impacts induced by environmental interactions, compliant control is widely regarded as an effective remedy. However, the model-free nature of RL makes it difficult to impose task-specified and quantitatively verifiable compliance objectives, and classical model-based stiffness designs are not directly applicable. Lipschitz-Constraine