Dynamic Execution Commitment of Vision-Language-Action Models
arXiv:2605.11567v1 Announce Type: new
Abstract: Vision-Language-Action (VLA) models predominantly adopt action chunking, i.e., predicting and committing to a short horizon of consecutive low-level actions in a single forward pass, to amortize the infe…