cs.AI, cs.LG

Act or Escalate? Evaluating Escalation Behavior in Automation with Language Models

arXiv:2604.08588v1 Announce Type: new
Abstract: Effective automation hinges on deciding when to act and when to escalate. We model this as a decision under uncertainty: an LLM forms a prediction, estimates its probability of being correct, and compare…