cs.LG

Sparse Concept Anchoring for Interpretable and Controllable Neural Representations

arXiv:2512.12469v3 Announce Type: replace
Abstract: We introduce Sparse Concept Anchoring, a method that biases latent space to position a targeted subset of concepts while allowing others to self-organize, using only minimal supervision (labels for