StateVLM: A State-Aware Vision-Language Model for Robotic Affordance Reasoning
arXiv:2605.03927v1 Announce Type: new
Abstract: Vision-language models (VLMs) have shown remarkable performance in various robotic tasks, as they can perceive visual information and understand natural language instructions. However, when applied to ro…