See What Matters: Differentiable Grid Sample Pruning for Generalizable Vision-Language-Action Model
arXiv:2605.11817v2 Announce Type: replace
Abstract: Vision-Language-Action (VLA) models have shown remarkable promise in robotics manipulation, yet their high computational cost hinders real-time deployment. Existing token pruning methods suffer from …