Grounding Multi-Hop Reasoning in Structural Causal Models via Group Relative Policy Optimization
arXiv:2605.01482v1 Announce Type: new
Abstract: Multi-Hop Fact Verification (MHFV) necessitates complex reasoning across disparate evidence, posing significant challenges for Large Language Models (LLMs) which often suffer from hallucinations and frac…