cs.AI, cs.DC

Characterizing Performance-Energy Trade-offs of Large Language Models in Multi-Request Workflows

arXiv:2604.09611v1 Announce Type: cross
Abstract: Large language models (LLMs) are increasingly used in applications forming multi-request workflows like document summarization, search-based copilots, and multi-agent programming. While these workflows…