cs.CL

Towards Reward Modeling for AI Tutors in Math Mistake Remediation

arXiv:2603.24375v1 Announce Type: new
Abstract: Evaluating the pedagogical quality of AI tutors remains challenging: standard NLG metrics do not determine whether responses identify mistakes, scaffold reasoning, or avoid revealing the answers. For the…