Phoneme-Level Deepfake Detection Across Emotional Conditions Using Self-Supervised Embeddings
arXiv:2605.03079v1 Announce Type: cross
Abstract: Recent advances in emotional voice conversion (EVC) have enabled the generation of expressive synthetic speech, raising new concerns in audio deepfake detection. Existing approaches treat speech as a h…