Author name: Yuhan Xie, Yuping Yan, Yunqi Zhao, Handing Wang, Yaochu Jin

STRONG-VLA: Decoupled Robustness Learning for Vision-Language-Action Models under Multimodal Perturbations

Yuhan Xie, Yuping Yan, Yunqi Zhao, Handing Wang, Yaochu Jin / April 15, 2026

arXiv:2604.10055v2 Announce Type: replace
Abstract: Despite their strong performance in embodied tasks, recent Vision-Language-Action (VLA) models remain highly fragile under multimodal perturbations, where visual corruption and linguistic noise joint…

cs.RO

Vision-Language-Action Model, Robustness, Multi-modal Learning, Robot Manipulation

Yuhan Xie, Yuping Yan, Yunqi Zhao, Handing Wang, Yaochu Jin / April 14, 2026

arXiv:2604.10055v1 Announce Type: new
Abstract: Despite their strong performance in embodied tasks, recent Vision-Language-Action (VLA) models remain highly fragile under multimodal perturbations, where visual corruption and linguistic noise jointly i…