Yann Berthelot, Philippe Preux, Riad Akrour

When (and How) to Trust the Expert: Diagnosing Query-Time Expert-Guided Reinforcement Learning

Yann Berthelot, Philippe Preux, Riad Akrour / May 12, 2026

arXiv:2605.09109v1 Announce Type: new
Abstract: Many continuous-control problems ship with a competent but suboptimal controller (a tuned PID, a hand-designed gait). A growing family of methods uses such controllers as queryable experts during RL, but…

Author name: Yann Berthelot, Philippe Preux, Riad Akrour

When (and How) to Trust the Expert: Diagnosing Query-Time Expert-Guided Reinforcement Learning