Only Say What You Know: Calibration-Aware Generation for Long-Form Factuality
arXiv:2605.01749v1 Announce Type: new
Abstract: Large Reasoning Models achieve strong performance on complex tasks but remain prone to hallucinations, particularly in long-form generation where errors compound across reasoning steps. Existing approach…