Towards Efficient Large Vision-Language Models: A Comprehensive Survey on Inference Strategies
arXiv:2603.27960v2 Announce Type: replace-cross
Abstract: Although Large Vision Language Models (LVLMs) have demonstrated impressive multimodal reasoning capabilities, their scalability and deployment are constrained by massive computational requireme…