ActDistill: General Action-Guided Self-Derived Distillation for Efficient Vision-Language-Action Models
arXiv:2511.18082v2 Announce Type: replace
Abstract: Recent Vision-Language-Action (VLA) models have shown impressive flexibility and generalization, yet their deployment in robotic manipulation remains limited by heavy computational overhead and infer…