cs.IT, cs.LG, math.IT

Expert Routing for Communication-Efficient MoE via Finite Expert Banks

arXiv:2605.05278v1 Announce Type: new
Abstract: Resource-efficient machine learning increasingly uses sparse Mixture-of-Experts (MoE) architectures, where the gate acts as both a learning component and a routing interface controlling computation, comm…