simmediumsim-to-realmetric · varies

Tune to Learn: How Controller Gains Shape Robot Policy Learning

Description

Position controllers have become the dominant interface for executing learned manipulation policies. Yet a critical design decision remains understudied: how should we choose controller gains for policy learning? The conventional wisdom is to select gains based on desired task compliance or stiffness. However, this logic breaks down when controllers are paired with state-conditioned policies: effective stiffness emerges from the interplay between learned reactions and control dynamics, not from

Source

http://arxiv.org/abs/2604.02523v1