Svetoslav Nizhnichenkov, Rahul Nair, Elizabeth Daly, Brian Mac Namee

A Representation-Level Assessment of Bias Mitigation in Foundation Models

Svetoslav Nizhnichenkov, Rahul Nair, Elizabeth Daly, Brian Mac Namee / April 13, 2026

arXiv:2604.08561v1 Announce Type: cross
Abstract: We investigate how successful bias mitigation reshapes the embedding space of encoder-only and decoder-only foundation models, offering an internal audit of model behaviour through representational ana…

Author name: Svetoslav Nizhnichenkov, Rahul Nair, Elizabeth Daly, Brian Mac Namee

A Representation-Level Assessment of Bias Mitigation in Foundation Models