TAG: Target-Agnostic Guidance for Stable Object-Centric Inference in Vision-Language-Action Models
arXiv:2603.24584v1 Announce Type: new
Abstract: Vision–Language–Action (VLA) policies have shown strong progress in mapping language instructions and visual observations to robotic actions, yet their reliability degrades in cluttered scenes with dis…