cs.CV

Reinforcing 3D Understanding in Point-VLMs via Geometric Reward Credit Assignment

arXiv:2604.21160v1 Announce Type: new
Abstract: Point-Vision-Language Models promise to empower embodied agents with executable spatial reasoning, yet they frequently succumb to geometric hallucination where predicted 3D structures contradict the obse…