cs.RO

Unmasking the Illusion of Embodied Reasoning in Vision-Language-Action Models

arXiv:2604.18000v1 Announce Type: new
Abstract: Recent Vision-Language-Action (VLA) models report impressive success rates on standard robotic benchmarks, fueling optimism about general-purpose physical intelligence. However, recent evidence suggests …