cs.CV

Channel Attention-Guided Cross-Modal Knowledge Distillation for Referring Image Segmentation

arXiv:2604.16806v1 Announce Type: new
Abstract: Referring image segmentation (RIS) requires accurate segmentation of target regions in images according to language descriptions, which is a cross-modal task integrating vision and language. Existing RIS…