cs.CV

VMAD: Visual-enhanced Multimodal Large Language Model for Zero-Shot Anomaly Detection

arXiv:2409.20146v2 Announce Type: replace
Abstract: Zero-shot anomaly detection (ZSAD) recognizes and localizes anomalies in previously unseen objects by establishing feature mapping between textual prompts and inspection images, demonstrating excelle…