cs.AI, cs.LG

Selecting Decision-Relevant Concepts in Reinforcement Learning

arXiv:2604.04808v1 Announce Type: cross
Abstract: Training interpretable concept-based policies requires practitioners to manually select which human-understandable concepts an agent should reason with when making sequential decisions. This selection …