EnergyLens: Interpretable Closed-Form Energy Models for Multimodal LLM Inference Serving
arXiv:2605.10556v2 Announce Type: replace
Abstract: As large language models span dense, mixture-of-experts, and state-space architectures and are deployed on heterogeneous accelerators under increasingly diverse multimodal workloads, optimising infer…