cs.CL, cs.LG

A Representation-Level Assessment of Bias Mitigation in Foundation Models

arXiv:2604.08561v1 Announce Type: cross
Abstract: We investigate how successful bias mitigation reshapes the embedding space of encoder-only and decoder-only foundation models, offering an internal audit of model behaviour through representational ana…