Failure Modes in Multi-Hop QA: The Weakest Link Effect and the Recognition Bottleneck
arXiv:2601.12499v2 Announce Type: replace-cross
Abstract: Despite scaling to massive context windows, Large Language Models (LLMs) struggle with multi-hop reasoning due to inherent position bias, which causes them to overlook information at certain po…