KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
arXiv:2506.19807v4 Announce Type: replace-cross
Abstract: Large Language Models (LLMs), particularly slow-thinking models, often exhibit severe hallucination, outputting incorrect content due to an inability to accurately recognize knowledge boundarie…