cs.AI, cs.LG

Towards Faster Language Model Inference Using Mixture-of-Experts Flow Matching

arXiv:2604.15009v1 Announce Type: cross
Abstract: Flow matching retains the generation quality of diffusion models while enabling substantially faster inference, making it a compelling paradigm for generative modeling. However, when applied to languag…