Grad-ECLIP: Gradient-based Visual and Textual Explanations for CLIP
arXiv:2502.18816v2 Announce Type: replace
Abstract: Significant progress has been achieved on the improvement and downstream usages of the Contrastive Language-Image Pre-training (CLIP) vision-language model, while less attention is paid to the interp…