cs.CV

Visual Instruction-Finetuned Language Model for Versatile Brain MR Image Tasks

arXiv:2604.02748v1 Announce Type: new
Abstract: LLMs have demonstrated remarkable capabilities in linguistic reasoning and are increasingly adept at vision-language tasks. The integration of image tokens into transformers has enabled direct visual inp…