ReXSonoVQA: A Video QA Benchmark for Procedure-Centric Ultrasound Understanding
arXiv:2604.10916v3 Announce Type: replace-cross
Abstract: Ultrasound acquisition requires skilled probe manipulation and real-time adjustments. Vision-language models (VLMs) could enable autonomous ultrasound systems, but existing benchmarks evaluate …