Reinforced Informativeness Optimization for Long-Form Retrieval-Augmented Generation
arXiv:2505.20825v2 Announce Type: replace
Abstract: Long-form question answering (LFQA) requires open-ended long-form responses that synthesize coherent, factually grounded content from multi-source evidence. This makes reinforcement learning (RL) rew…