cs.CV

Attention Misses Visual Risk: Risk-Adaptive Steering for Multimodal Safety Alignment

arXiv:2510.13698v3 Announce Type: replace
Abstract: Even modern AI models often remain vulnerable to multimodal queries in which harmful intent is embedded in images. A widely used approach for safety alignment is training with extensive multimodal sa…