Matthew DosSantos DiSorbo, Harang Ju

Act or Escalate? Evaluating Escalation Behavior in Automation with Language Models

Matthew DosSantos DiSorbo, Harang Ju / April 13, 2026

arXiv:2604.08588v1 Announce Type: new
Abstract: Effective automation hinges on deciding when to act and when to escalate. We model this as a decision under uncertainty: an LLM forms a prediction, estimates its probability of being correct, and compare…

Author name: Matthew DosSantos DiSorbo, Harang Ju

Act or Escalate? Evaluating Escalation Behavior in Automation with Language Models