← Back to Benchmarks
simmediumhumanoidmetric · varies
PCHC: Enabling Preference Conditioned Humanoid Control via Multi-Objective Reinforcement Learning
Description
Humanoid robots often need to balance competing objectives, such as maximizing speed while minimizing energy consumption. While current reinforcement learning (RL) methods can master complex skills like fall recovery and perceptive locomotion, they are constrained by fixed weighting strategies that produce a single suboptimal policy, rather than providing a diverse set of solutions for sophisticated multi-objective control. In this paper, we propose a novel framework leveraging Multi-Objective R