cs.CV

Reasoning-Driven Anomaly Detection and Localization with Image-Level Supervision

arXiv:2603.27179v1 Announce Type: new
Abstract: Multimodal large language models (MLLMs) have recently demonstrated remarkable reasoning and perceptual abilities for anomaly detection. However, most approaches remain confined to image-level anomaly de…