Replay-buffer engineering for noise-robust quantum circuit optimization
arXiv:2604.21863v1 Announce Type: cross
Abstract: Deep reinforcement learning (RL) for quantum circuit optimization faces three fundamental bottlenecks: replay buffers that ignore the reliability of temporal-difference (TD) targets, curriculum-based a…