Contextual Linear Activation Steering of Language Models
arXiv:2604.24693v1 Announce Type: new
Abstract: Linear activation steering is a powerful approach for eliciting the capabilities of large language models and specializing their behavior using limited labeled data. While effective, existing methods oft…