Mean-field limit from general mixtures of experts to quantum neural networks
arXiv:2501.14660v2 Announce Type: replace-cross
Abstract: In this work, we study the asymptotic behavior of Mixture of Experts (MoE) trained via gradient flow on supervised learning problems. Our main result establishes the propagation of chaos for a …