cs.AI, cs.LG, cs.RO

AsyncVLA: Asynchronous Flow Matching for Vision-Language-Action Models

arXiv:2511.14148v2 Announce Type: replace
Abstract: Vision-language-action (VLA) models have recently emerged as a powerful paradigm for building generalist robots. However, traditional VLA models that generate actions through flow matching (FM) typic…