cs.AI, cs.SD

ATIR: Towards Audio-Text Interleaved Contextual Retrieval

arXiv:2604.20267v1 Announce Type: cross
Abstract: Audio carries richer information than text, including emotion, speaker traits, and environmental context, while also enabling lower-latency processing compared to speech-to-text pipelines. However, rec…