Multiple Consistent 2D-3D Mappings for Robust Zero-Shot 3D Visual Grounding
arXiv:2604.26261v1 Announce Type: new
Abstract: Zero-shot 3D Visual Grounding (3DVG) is a critical capability for open-world embodied AI. However, existing methods are fundamentally bottlenecked by the poor quality of open-vocabulary 3D proposals, suf…