Can David Beat Goliath? On Multi-Hop Reasoning with Resource-Constrained Agents
arXiv:2601.21699v3 Announce Type: replace
Abstract: Multi-turn reasoning agents solve complex questions by decomposing them into intermediate retrieval or tool-use steps, for accumulating supporting evidence across turns. Meanwhile, with reinforcement…