Sparsity is Combinatorial Depth: Quantifying MoE Expressivity via Tropical Geometry
arXiv:2602.03204v2 Announce Type: replace
Abstract: While Mixture-of-Experts (MoE) architectures define the state-of-the-art, their theoretical success is often attributed to heuristic efficiency rather than geometric expressivity. In this work, we pr…