cs.CV, cs.LG, eess.AS

Mitigating Multimodal LLMs Hallucinations via Relevance Propagation at Inference Time

arXiv:2605.01766v1 Announce Type: new
Abstract: Multimodal large language models (MLLMs) have revolutionized the landscape of AI, demonstrating impressive capabilities in tackling complex vision and audio-language tasks. However, a critical challenge …