cs.CL, cs.SD, eess.AS

Alethia: A Foundational Encoder for Voice Deepfakes

arXiv:2605.00251v1 Announce Type: cross
Abstract: Existing voice deepfake detection and localization models rely heavily on representations extracted from speech foundation models (SFMs). However, downstream finetuning has now reached a state of dimin…