Breaking the Computational Barrier: Provably Efficient Actor-Critic for Low-Rank MDPs
arXiv:2605.01242v1 Announce Type: new
Abstract: Reinforcement learning (RL) is a fundamental framework for sequential decision-making, in which an agent learns an optimal policy through interactions with an unknown environment. In settings with functi…