Expert Routing for Communication-Efficient MoE via Finite Expert Banks
arXiv:2605.05278v1 Announce Type: new
Abstract: Resource-efficient machine learning increasingly uses sparse Mixture-of-Experts (MoE) architectures, where the gate acts as both a learning component and a routing interface controlling computation, comm…