simmediumpolicy-learningmetric · varies

Physics-Informed Policy Optimization via Analytic Dynamics Regularization

Description

Reinforcement learning (RL) has achieved strong performance in robotic control; however, state-of-the-art policy learning methods, such as actor-critic methods, still suffer from high sample complexity and often produce physically inconsistent actions. This limitation stems from neural policies implicitly rediscovering complex physics from data alone, despite accurate dynamics models being readily available in simulators. In this paper, we introduce a novel physics-informed RL framework, called

Source

http://arxiv.org/abs/2603.14469v2