cs.AI, cs.CV

ForgeryGPT: A Multimodal LLM for Interpretable Image Forgery Detection and Localization

arXiv:2410.10238v3 Announce Type: replace
Abstract: Multimodal Large Language Models (MLLMs), such as GPT4o, have shown strong capabilities in visual reasoning and explanation generation. However, despite these strengths, they face significant challen…