Towards Understanding, Analyzing, and Optimizing Agentic AI Execution: A CPU-Centric Perspective
arXiv:2511.00739v3 Announce Type: replace
Abstract: Agentic AI serving converts monolithic LLM-based inference to autonomous problem-solvers that can plan, call tools, perform reasoning, and adapt on the fly. Due to diverse task execution need, such s…