cs.AI, cs.RO

Causal Scene Narration with Runtime Safety Supervision for Vision-Language-Action Driving

arXiv:2604.01723v1 Announce Type: new
Abstract: Vision-Language-Action (VLA) models for autonomous driving must integrate diverse textual inputs, including navigation commands, hazard warnings, and traffic state descriptions, yet current systems often…