Identifying and typifying demographic unfairness in phoneme-level embeddings of self-supervised speech recognition models
arXiv:2604.22631v1 Announce Type: new
Abstract: Modern automatic speech recognition (ASR) systems have been observed to function better for certain speaker groups (SGs) than others, despite recent gains in overall performance. One potential impediment…