Steering Code LLMs with Activation Directions for Language and Library Control
arXiv:2603.23629v2 Announce Type: replace
Abstract: Code LLMs often default to particular programming languages and libraries under neutral prompts. We investigate whether these preferences are encoded as approximately linear directions in activation …