simmediumlocomotionmetric · varies

Trajectory Entropy Reinforcement Learning for Predictable and Robust Control

Description

Simplicity is a critical inductive bias for designing data-driven controllers, especially when robustness is important. Despite the impressive results of deep reinforcement learning in complex control tasks, it is prone to capturing intricate and spurious correlations between observations and actions, leading to failure under slight perturbations to the environment. To tackle this problem, in this work we introduce a novel inductive bias towards simple policies in reinforcement learning. The sim

Source

http://arxiv.org/abs/2505.04193v1