← Back to Benchmarks
simmediumlocomotionmetric · varies
Spectral Normalization for Lipschitz-Constrained Policies on Learning Humanoid Locomotion
Description
Reinforcement learning (RL) has shown great potential in training agile and adaptable controllers for legged robots, enabling them to learn complex locomotion behaviors directly from experience. However, policies trained in simulation often fail to transfer to real-world robots due to unrealistic assumptions such as infinite actuator bandwidth and the absence of torque limits. These conditions allow policies to rely on abrupt, high-frequency torque changes, which are infeasible for real actuator