cs.CL

How Value Induction Reshapes LLM Behaviour

arXiv:2605.07925v1 Announce Type: new
Abstract: Conversational Large Language Models are post-trained on language that expresses specific behavioural traits, such as curiosity, open-mindedness, and empathy, and values, such as helpfulness, harmlessnes…