← Back to Benchmarks
simmediumhumanoidmetric · varies

PCHC: Enabling Preference Conditioned Humanoid Control via Multi-Objective Reinforcement Learning

Description

Humanoid robots often need to balance competing objectives, such as maximizing speed while minimizing energy consumption. While current reinforcement learning (RL) methods can master complex skills like fall recovery and perceptive locomotion, they are constrained by fixed weighting strategies that produce a single suboptimal policy, rather than providing a diverse set of solutions for sophisticated multi-objective control. In this paper, we propose a novel framework leveraging Multi-Objective R

Source

http://arxiv.org/abs/2603.24047v1