cs.AI, cs.IT, cs.SY, eess.SY, math.IT

Energy-Aware Routing to Large Reasoning Models

arXiv:2601.00823v2 Announce Type: replace
Abstract: Large reasoning models (LRMs) have heterogeneous inference energy costs based on which model is used and how much it reasons. To reduce energy, it is important to choose the right LRM and operate it …