Spectro-Temporal Modulation Representation Framework for Human-Imitated Speech Detection
arXiv:2604.23241v1 Announce Type: cross
Abstract: Human-imitated speech poses a greater challenge than AI-generated speech for both human listeners and automatic detection systems. Unlike AI-generated speech, which often contains artifacts, over-smoot…