Scaling Cross-Environment Failure Reasoning Data for Vision-Language Robotic Manipulation
arXiv:2512.01946v3 Announce Type: replace-cross
Abstract: Robust robotic manipulation requires reliable failure detection and recovery. Although recent Vision-Language Models (VLMs) show promise in robot failure detection, their generalization is seve…