Muyang Li, Yucheng Liu, Jianbo Ma, Elliot Osborne, Bo Han, Tongliang Liu

Rethinking Model Selection in VLM Through the Lens of Gromov-Wasserstein Distance

Muyang Li, Yucheng Liu, Jianbo Ma, Elliot Osborne, Bo Han, Tongliang Liu / May 5, 2026

arXiv:2605.01325v1 Announce Type: new
Abstract: Vision-Language Models (VLMs) have enhanced traditional LLMs with visual capabilities through the integration of vision encoders. While recent works have explored various combinations of vision encoders …

Author name: Muyang Li, Yucheng Liu, Jianbo Ma, Elliot Osborne, Bo Han, Tongliang Liu

Rethinking Model Selection in VLM Through the Lens of Gromov-Wasserstein Distance