Location-Aware Pretraining for Medical Difference Visual Question Answering
arXiv:2603.04950v2 Announce Type: replace
Abstract: Differential medical VQA models compare multiple images to identify clinically meaningful changes and rely on vision encoders to capture fine-grained visual differences that reflect radiologists’ com…