Psychological Concept Neurons: Can Neural Control Bias Probing and Shift Generation in LLMs?
arXiv:2604.11802v1 Announce Type: new
Abstract: Using psychological constructs such as the Big Five, large language models (LLMs) can imitate specific personality profiles and predict a user’s personality. While LLMs can exhibit behaviors consistent w…