Exploration and Exploitation Errors Are Measurable for Language Model Agents
arXiv:2604.13151v1 Announce Type: new
Abstract: Language Model (LM) agents are increasingly used in complex open-ended decision-making tasks, from AI coding to physical AI. A core requirement in these settings is the ability to both explore the proble…