← Back to Benchmarks
simmediumrlmetric · varies

Selecting Decision-Relevant Concepts in Reinforcement Learning

Description

Training interpretable concept-based policies requires practitioners to manually select which human-understandable concepts an agent should reason with when making sequential decisions. This selection demands domain expertise, is time-consuming and costly, scales poorly with the number of candidates, and provides no performance guarantees. To overcome this limitation, we propose the first algorithms for principled automatic concept selection in sequential decision-making. Our key insight is that

Source

http://arxiv.org/abs/2604.04808v1