Hierarchical Support Vector State Partitioning for Distilling Black Box Reinforcement Learning Policies
arXiv:2605.04254v1 Announce Type: new
Abstract: We introduce State Vector Space Partitioning (SVSP), a novel method to mimic a black box reinforcement learning policy using a set of human-interpretable subpolicies. By partitioning a distillation datas…