Medical thinking with multiple images
arXiv:2604.16506v2 Announce Type: replace-cross
Abstract: Large language models perform well on many medical QA benchmarks, but real clinical reasoning often requires integrating evidence across multiple images rather than interpreting a single view. …