cs.AI, cs.CL

How Much Heavy Lifting Can an Agent Harness Do?: Measuring the LLM’s Residual Role in a Planning Agent

arXiv:2604.07236v3 Announce Type: replace-cross
Abstract: Agent harnesses — the stateful programs that wrap a language model and decide what it sees at each step — are now known to change end-to-end performance on a fixed model by as much as six tim…