simmediumlocomotionmetric · varies

RL-Augmented MPC for Non-Gaited Legged and Hybrid Locomotion

Description

We propose a contact-explicit hierarchical architecture coupling Reinforcement Learning (RL) and Model Predictive Control (MPC), where a high-level RL agent provides gait and navigation commands to a low-level locomotion MPC. This offloads the combinatorial burden of contact timing from the MPC by learning acyclic gaits through trial and error in simulation. We show that only a minimal set of rewards and limited tuning are required to obtain effective policies. We validate the architecture in si

Source

http://arxiv.org/abs/2603.10878v1