cs.CV

Decoupled Similarity for Task-Aware Token Pruning in Large Vision-Language Models

arXiv:2604.11240v1 Announce Type: new
Abstract: Token pruning has emerged as an effective approach to reduce the substantial computational overhead of Large Vision-Language Models (LVLMs) by discarding less informative visual tokens while preserving p…