Finetune Like You Pretrain: Boosting Zero-shot Adversarial Robustness in Vision-language Models
arXiv:2604.11576v1 Announce Type: new
Abstract: Despite their impressive zero-shot abilities, vision-language models such as CLIP have been shown to be susceptible to adversarial attacks. To enhance its adversarial robustness, recent studies finetune …