Hao-Yuan Chen - Provide.ai

Process Supervision via Verbal Critique Improves Reasoning in Large Language Models

Hao-Yuan Chen / April 24, 2026

arXiv:2604.21611v1 Announce Type: cross
Abstract: Inference-time scaling for LLM reasoning has focused on three axes: chain depth, sample breadth, and learned step-scorers (PRMs). We introduce a fourth axis, granularity of external verbal supervision,…

Author name: Hao-Yuan Chen

Process Supervision via Verbal Critique Improves Reasoning in Large Language Models