OTPrune: Distribution-Aligned Visual Token Pruning via Optimal Transport
arXiv:2602.20205v3 Announce Type: replace
Abstract: Multi-modal large language models (MLLMs) achieve strong visual-language reasoning but suffer from high inference cost due to redundant visual tokens. Recent work explores visual token pruning to acc…