RL Token: Bootstrapping Online RL with Vision-Language-Action Models
arXiv:2604.23073v2 Announce Type: replace-cross
Abstract: Vision-language-action (VLA) models can learn to perform diverse manipulation skills “out of the box,” but achieving the precision and speed that real-world tasks demand requires further fine-t…