Find, Fix, Reason: Context Repair for Video Reasoning
arXiv:2604.16243v1 Announce Type: new
Abstract: Reinforcement learning has advanced video reasoning in large multi-modal models, yet dominant pipelines either rely on on-policy self-exploration, which plateaus at the model’s knowledge boundary, or hyb…