simmediumroboticsmetric · varies

Robust Quadruped Locomotion via Evolutionary Reinforcement Learning

Description

Deep reinforcement learning has recently achieved strong results in quadrupedal locomotion, yet policies trained in simulation often fail to transfer when the environment changes. Evolutionary reinforcement learning aims to address this limitation by combining gradient-based policy optimisation with population-driven exploration. This work evaluates four methods on a simulated walking task: DDPG, TD3, and two Cross-Entropy-based variants CEM-DDPG and CEM-TD3. All agents are trained on flat terra

Source

http://arxiv.org/abs/2604.07224v1