StreamingVLA: Streaming Vision-Language-Action Model with Action Flow Matching and Adaptive Early Observation
arXiv:2603.28565v1 Announce Type: new
Abstract: Vision-language-action (VLA) models have demonstrated exceptional performance in natural language-driven perception and control. However, the high computational cost of VLA models poses significant effic…