Author name: Jongmin Shin, Ka Young Kim, Eunki Cho, Seong Tae Kim, Namkee Oh

SurgCheck: Do Vision-Language Models Really Look at Images in Surgical VQA?

Jongmin Shin, Ka Young Kim, Eunki Cho, Seong Tae Kim, Namkee Oh / May 6, 2026

arXiv:2605.01911v2 Announce Type: replace
Abstract: Purpose: Vision-language models (VLMs) have shown promising performance in surgical visual question answering (VQA). However, existing surgical VQA datasets often contain linguistic shortcuts, where …

cs.CV

SurgCheck: Do Vision-Language Models Really Look at Images in Surgical VQA?

Jongmin Shin, Ka Young Kim, Eunki Cho, Seong Tae Kim, Namkee Oh / May 5, 2026

arXiv:2605.01911v1 Announce Type: new
Abstract: Purpose: Vision-language models (VLMs) have shown promising performance in surgical visual question answering (VQA). However, existing surgical VQA datasets often contain linguistic shortcuts, where ques…