Diagnose, Correct, and Learn from Manipulation Failures via Visual Symbols
arXiv:2512.02787v3 Announce Type: replace
Abstract: Vision-Language-Action (VLA) models have recently achieved remarkable progress in robotic manipulation, yet they remain limited in failure diagnosis and learning from failures. Additionally, existing…