Audio-to-Image Bird Species Retrieval without Audio-Image Pairs via Text Distillation
arXiv:2602.00681v2 Announce Type: replace-cross
Abstract: Audio-to-image retrieval offers an interpretable alternative to audio-only classification for bioacoustic species recognition, but learning aligned audio-image representations is challenging du…