Sandy Fraser, Patryk Wielopolski

Sparse Concept Anchoring for Interpretable and Controllable Neural Representations

Sandy Fraser, Patryk Wielopolski / April 28, 2026

arXiv:2512.12469v3 Announce Type: replace
Abstract: We introduce Sparse Concept Anchoring, a method that biases latent space to position a targeted subset of concepts while allowing others to self-organize, using only minimal supervision (labels for

Author name: Sandy Fraser, Patryk Wielopolski

Sparse Concept Anchoring for Interpretable and Controllable Neural Representations