cs.CV

Grounding by Remembering: Cross-Scene and In-Scene Memory for 3D Functional Affordances

arXiv:2605.11616v1 Announce Type: new
Abstract: Functional affordance grounding requires more than recognizing an object: an agent must localize the specific region that supports an interaction, such as the handle to pull or the button to press. This …