Enhancing the Safety of Medical Vision-Language Models by Synthetic Demonstrations
arXiv:2506.09067v2 Announce Type: replace
Abstract: Generative medical vision-language models~(Med-VLMs) are primarily designed to generate complex textual information~(e.g., diagnostic reports) from multimodal inputs including vision modality~(e.g., …