simmediumhumanoidmetric · varies

Learning Sim-to-Real Humanoid Locomotion in 15 Minutes

Description

Massively parallel simulation has reduced reinforcement learning (RL) training time for robots from days to minutes. However, achieving fast and reliable sim-to-real RL for humanoid control remains difficult due to the challenges introduced by factors such as high dimensionality and domain randomization. In this work, we introduce a simple and practical recipe based on off-policy RL algorithms, i.e., FastSAC and FastTD3, that enables rapid training of humanoid locomotion policies in just 15 minu

Source

http://arxiv.org/abs/2512.01996v1