cs.AI

SCRIBE: Structured Mid-Level Supervision for Tool-Using Language Models

arXiv:2601.03555v2 Announce Type: replace
Abstract: Training reliable tool-augmented agents remains a significant challenge, largely due to the difficulty of credit assignment in multi-step reasoning. While process-level reward models offer a promisin…