cs.AI, cs.CL, cs.CV, cs.RO

Libra-VLA: Achieving Learning Equilibrium via Asynchronous Coarse-to-Fine Dual-System

arXiv:2604.24921v1 Announce Type: cross
Abstract: Vision-Language-Action (VLA) models are a promising paradigm for generalist robotic manipulation by grounding high-level semantic instructions into executable physical actions. However, prevailing appr…