Don’t Lose Focus: Activation Steering via Key-Orthogonal Projections
arXiv:2605.06342v1 Announce Type: new
Abstract: Activation steering controls LLM behaviour towards target behaviour by intervening in internal representations, yet it often degrades reasoning and retrieval performance. We argue that a primary cause of…