CorridorVLA: Explicit Spatial Constraints for Generative Action Heads via Sparse Anchors
arXiv:2604.21241v1 Announce Type: new
Abstract: Vision–Language–Action (VLA) models often use intermediate representations to connect multimodal inputs with continuous control, yet spatial guidance is often injected implicitly through latent feature…