Energy-Aware Routing to Large Reasoning Models
arXiv:2601.00823v2 Announce Type: replace
Abstract: Large reasoning models (LRMs) have heterogeneous inference energy costs based on which model is used and how much it reasons. To reduce energy, it is important to choose the right LRM and operate it …