cs.AI, cs.CL

LoVeC: Reinforcement Learning for Better Verbalized Confidence in Long-Form Generations

arXiv:2505.23912v2 Announce Type: replace-cross
Abstract: Hallucination remains a major challenge for the safe and trustworthy deployment of large language models (LLMs) in factual content generation. Prior work has explored confidence estimation as a…