cs.LG

The Linear Centroids Hypothesis: How Deep Network Features Represent Data

arXiv:2604.11962v1 Announce Type: new
Abstract: Identifying and understanding the features that a deep network (DN) extracts from its inputs to produce its outputs is a focal point of interpretability research. The Linear Representation Hypothesis (LR…