Surendra Pathak, Bo Han

Towards Efficient Large Vision-Language Models: A Comprehensive Survey on Inference Strategies

Surendra Pathak, Bo Han / April 14, 2026

arXiv:2603.27960v2 Announce Type: replace-cross
Abstract: Although Large Vision Language Models (LVLMs) have demonstrated impressive multimodal reasoning capabilities, their scalability and deployment are constrained by massive computational requireme…

Author name: Surendra Pathak, Bo Han

Towards Efficient Large Vision-Language Models: A Comprehensive Survey on Inference Strategies