cs.RO

BlockVLA: Accelerating Autoregressive VLA via Block Diffusion Finetuning

arXiv:2605.13382v1 Announce Type: new
Abstract: While autoregressive (AR) Vision-Language-Action (VLA) models have demonstrated formidable reasoning capabilities in robotic tasks, their sequential decoding process often incurs high inference latency a…