Camouflage-aware Image-Text Retrieval via Expert Collaboration
arXiv:2604.01251v1 Announce Type: new
Abstract: Camouflaged scene understanding (CSU) has attracted significant attention due to its broad practical implications. However, in this field, robust image-text cross-modal alignment remains under-explored, …