cs.MA, cs.RO

Assessing VLM-Driven Semantic-Affordance Inference for Non-Humanoid Robot Morphologies

arXiv:2604.19509v1 Announce Type: new
Abstract: Vision-language models (VLMs) have demonstrated remarkable capabilities in understanding human-object interactions, but their application to robotic systems with non-humanoid morphologies remains largely…