cs.AI, cs.CV

MedLVR: Latent Visual Reasoning for Reliable Medical Visual Question Answering

arXiv:2604.09757v1 Announce Type: cross
Abstract: Medical vision–language models (VLMs) have shown strong potential for medical visual question answering (VQA), yet their reasoning remains largely text-centric: images are encoded once as static conte…