Ivo Petrov, Jasper Dekoninck, Dimitar I. Dimitrov, Martin Vechev

Not All Proofs Are Equal: Evaluating LLM Proof Quality Beyond Correctness

Ivo Petrov, Jasper Dekoninck, Dimitar I. Dimitrov, Martin Vechev / May 12, 2026

arXiv:2605.10379v1 Announce Type: new
Abstract: Large language models (LLMs) have become capable mathematical problem-solvers, often producing correct proofs for challenging problems. However, correctness alone is not sufficient: mathematical proofs s…

Author name: Ivo Petrov, Jasper Dekoninck, Dimitar I. Dimitrov, Martin Vechev

Not All Proofs Are Equal: Evaluating LLM Proof Quality Beyond Correctness