cs.CV

Thinking with Geometry: Active Geometry Integration for Spatial Reasoning

arXiv:2602.06037v3 Announce Type: replace
Abstract: Recent progress in spatial reasoning with Multimodal Large Language Models (MLLMs) increasingly leverages geometric priors from 3D encoders. However, most existing integration strategies remain passi…