Akash Kundu, Sebastian Feld

Replay-buffer engineering for noise-robust quantum circuit optimization

Akash Kundu, Sebastian Feld / April 24, 2026

arXiv:2604.21863v1 Announce Type: cross
Abstract: Deep reinforcement learning (RL) for quantum circuit optimization faces three fundamental bottlenecks: replay buffers that ignore the reliability of temporal-difference (TD) targets, curriculum-based a…

Author name: Akash Kundu, Sebastian Feld

Replay-buffer engineering for noise-robust quantum circuit optimization