simmediumhumanoidmetric · varies

Task-Specified Compliance Bounds for Humanoids via Lipschitz-Constrained Policies

Description

Reinforcement learning (RL) has demonstrated substantial potential for humanoid bipedal locomotion and the control of complex motions. To cope with oscillations and impacts induced by environmental interactions, compliant control is widely regarded as an effective remedy. However, the model-free nature of RL makes it difficult to impose task-specified and quantitatively verifiable compliance objectives, and classical model-based stiffness designs are not directly applicable. Lipschitz-Constraine

Source

http://arxiv.org/abs/2603.16180v2