Mariano Fern\'andez M\'endez

Descriptor-Injected Cross-Modal Learning: A Systematic Exploration of Audio-MIDI Alignment via Spectral and Melodic Features

Mariano Fern\'andez M\'endez / April 14, 2026

arXiv:2604.10283v1 Announce Type: cross
Abstract: Cross-modal retrieval between audio recordings and symbolic music representations (MIDI) remains challenging because continuous waveforms and discrete event sequences encode different aspects of the sa…

Author name: Mariano Fern\'andez M\'endez

Descriptor-Injected Cross-Modal Learning: A Systematic Exploration of Audio-MIDI Alignment via Spectral and Melodic Features