Semantic Alignment in Hyperbolic Space for Open-Vocabulary Semantic Segmentation
arXiv:2605.08874v1 Announce Type: new
Abstract: Open-vocabulary semantic segmentation requires adapting image-level vision-language models such as CLIP to dense pixel-level prediction, which is challenging due to the mismatch between hierarchical stru…