Switch-KD: Visual-Switch Knowledge Distillation for Vision-Language Models
arXiv:2604.14629v1 Announce Type: new
Abstract: Vision-Language Models (VLMs) have shown remarkable capabilities in joint vision-language understanding, but their large scale poses significant challenges for deployment in resource-constrained scenario…