cs.AI, cs.CV

CamReasoner: Reinforcing Camera Movement Understanding via Structured Spatial Reasoning

arXiv:2602.00181v3 Announce Type: replace-cross
Abstract: Understanding camera dynamics is a fundamental pillar of video spatial intelligence. However, existing multimodal models predominantly treat this task as a black-box classification, often confu…