TRIAGE: Evaluating Prospective Metacognitive Control in LLMs under Resource Constraints
arXiv:2605.13414v1 Announce Type: new
Abstract: Deploying language models as autonomous agents requires more than per-task accuracy: when an agent faces a queue of problems under a finite token budget, it must decide which to attempt, in what order, a…