Principal Prototype Analysis on Manifold for Interpretable Reinforcement Learning
arXiv:2603.27971v2 Announce Type: replace
Abstract: Recent years have witnessed the widespread adoption of reinforcement learning (RL), from solving real-time games to fine-tuning large language models using human preference data significantly improvi…