Verifier-Backed Hard Problem Generation for Mathematical Reasoning
arXiv:2605.06660v1 Announce Type: cross
Abstract: Large Language Models (LLMs) demonstrate strong capabilities for solving scientific and mathematical problems, yet they struggle to produce valid, challenging, and novel problems – an essential compone…