cs.CV

RVLM: Recursive Vision-Language Models with Adaptive Depth

arXiv:2603.24224v1 Announce Type: new
Abstract: Medical AI systems face two fundamental limitations. First, conventional vision-language models (VLMs) perform single-pass inference, yielding black-box predictions that cannot be audited or explained in…