cs.CL

MedPRMBench: A Fine-grained Benchmark for Process Reward Models in Medical Reasoning

arXiv:2604.17282v1 Announce Type: new
Abstract: Process-Level Reward Models (PRMs) are essential for guiding complex reasoning in large language models, yet existing PRM benchmarks cover only general domains such as mathematics, failing to address med…