Au-M-ol: A Unified Model for Medical Audio and Language Understanding
arXiv:2604.23284v1 Announce Type: new
Abstract: In this work, we present Au-M-ol, a novel multimodal architecture that extends Large Language Models (LLMs) with audio processing. It is designed to improve performance on clinically relevant tasks such …