KIRA: Knowledge-Intensive Image Retrieval and Reasoning Architecture for Specialized Visual Domains
arXiv:2604.16915v1 Announce Type: new
Abstract: Retrieval augmented generation (RAG) has transformed text based question answering, yet its extension to visual domains remains hindered by fundamental challenges: bridging the modality gap between image…