Daniel Ranard - Provide.ai

Likelihood scoring for continuations of mathematical text: a self-supervised benchmark with tests for shortcut vulnerabilities

Daniel Ranard / May 12, 2026

arXiv:2605.10810v1 Announce Type: new
Abstract: We introduce an automatically generated benchmark for predicting hidden text in technical papers. A paper supplies visible context $X$ and a hidden continuation $Y$; the evaluated model writes an auxilia…

Author name: Daniel Ranard

Likelihood scoring for continuations of mathematical text: a self-supervised benchmark with tests for shortcut vulnerabilities