Alethia: A Foundational Encoder for Voice Deepfakes
arXiv:2605.00251v1 Announce Type: cross
Abstract: Existing voice deepfake detection and localization models rely heavily on representations extracted from speech foundation models (SFMs). However, downstream finetuning has now reached a state of dimin…