SVD-Prune: Training-Free Token Pruning For Efficient Vision-Language Models
arXiv:2604.11530v1 Announce Type: cross
Abstract: Vision-Language Models (VLM) have revolutionized multimodal learning by jointly processing visual and textual information. Yet, they face significant challenges due to the high computational and memory…