Sparse Concept Anchoring for Interpretable and Controllable Neural Representations
arXiv:2512.12469v3 Announce Type: replace
Abstract: We introduce Sparse Concept Anchoring, a method that biases latent space to position a targeted subset of concepts while allowing others to self-organize, using only minimal supervision (labels for