cs.AI, cs.CV

SVD-Prune: Training-Free Token Pruning For Efficient Vision-Language Models

arXiv:2604.11530v1 Announce Type: cross
Abstract: Vision-Language Models (VLM) have revolutionized multimodal learning by jointly processing visual and textual information. Yet, they face significant challenges due to the high computational and memory…