Vamshi Nallaguntla, Shruti Kshirsagar, Anderson R. Avila

Phoneme-Level Deepfake Detection Across Emotional Conditions Using Self-Supervised Embeddings

Vamshi Nallaguntla, Shruti Kshirsagar, Anderson R. Avila / May 6, 2026

arXiv:2605.03079v1 Announce Type: cross
Abstract: Recent advances in emotional voice conversion (EVC) have enabled the generation of expressive synthetic speech, raising new concerns in audio deepfake detection. Existing approaches treat speech as a h…

Author name: Vamshi Nallaguntla, Shruti Kshirsagar, Anderson R. Avila

Phoneme-Level Deepfake Detection Across Emotional Conditions Using Self-Supervised Embeddings