A Representation-Level Assessment of Bias Mitigation in Foundation Models
arXiv:2604.08561v1 Announce Type: cross
Abstract: We investigate how successful bias mitigation reshapes the embedding space of encoder-only and decoder-only foundation models, offering an internal audit of model behaviour through representational ana…