Zhiyuan Xu, Joseph Gardiner, Sana Belguith, Lichao Wu

RouteHijack: Routing-Aware Attack on Mixture-of-Experts LLMs

Zhiyuan Xu, Joseph Gardiner, Sana Belguith, Lichao Wu / May 6, 2026

arXiv:2605.02946v1 Announce Type: new
Abstract: Safety alignment is critical for the responsible deployment of large language models (LLMs). As Mixture-of-Experts (MoE) architectures are increasingly adopted to scale model capacity, understanding thei…

Author name: Zhiyuan Xu, Joseph Gardiner, Sana Belguith, Lichao Wu

RouteHijack: Routing-Aware Attack on Mixture-of-Experts LLMs