Justn williams, Kishor Datta Gupta, Roy George, Mrinmoy Sarkar

LiteVLA-H: Dual-Rate Vision-Language-Action Inference for Onboard Aerial Guidance and Semantic Perception

Justn williams, Kishor Datta Gupta, Roy George, Mrinmoy Sarkar / May 5, 2026

arXiv:2605.00884v1 Announce Type: new
Abstract: Vision-language-action (VLA) models have shown strong semantic grounding and task generalization in manipulation, but aerial deployment remains difficult because drones require low-latency closed-loop gu…

Author name: Justn williams, Kishor Datta Gupta, Roy George, Mrinmoy Sarkar

LiteVLA-H: Dual-Rate Vision-Language-Action Inference for Onboard Aerial Guidance and Semantic Perception