SurgCheck: Do Vision-Language Models Really Look at Images in Surgical VQA?
arXiv:2605.01911v2 Announce Type: replace
Abstract: Purpose: Vision-language models (VLMs) have shown promising performance in surgical visual question answering (VQA). However, existing surgical VQA datasets often contain linguistic shortcuts, where …