Baochang Ren, Shuofei Qiao, Da Zheng, Huajun Chen, Ningyu Zhang

KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality

Baochang Ren, Shuofei Qiao, Da Zheng, Huajun Chen, Ningyu Zhang / April 17, 2026

arXiv:2506.19807v4 Announce Type: replace-cross
Abstract: Large Language Models (LLMs), particularly slow-thinking models, often exhibit severe hallucination, outputting incorrect content due to an inability to accurately recognize knowledge boundarie…

Author name: Baochang Ren, Shuofei Qiao, Da Zheng, Huajun Chen, Ningyu Zhang

KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality