LLM-Guided Runtime Parameter Optimization for Energy-Efficient Model Inference
arXiv:2604.27032v1 Announce Type: cross
Abstract: Large Language Models (LLMs) have become an integral part of many real-world workflows. However, LLMs consume a lot of energy, which becomes a large concern in the scale of the demand for these tools. …