simmediumlocomotionmetric · varies

Non-conflicting Energy Minimization in Reinforcement Learning based Robot Control

Description

Efficient robot control often requires balancing task performance with energy expenditure. A common approach in reinforcement learning (RL) is to penalize energy use directly as part of the reward function. This requires carefully tuning weight terms to avoid undesirable trade-offs where energy minimization harms task success. In this work, we propose a hyperparameter-free gradient optimization method to minimize energy expenditure without conflicting with task performance. Inspired by recent wo

Source

http://arxiv.org/abs/2509.01765v1