cs.CV

Audio-Visual Intelligence in Large Foundation Models

arXiv:2605.04045v1 Announce Type: new
Abstract: Audio-Visual Intelligence (AVI) has emerged as a central frontier in artificial intelligence, bridging auditory and visual modalities to enable machines that can perceive, generate, and interact in the m…

Scroll to Top