Soham Gadgil, Chris Lin, Su-In Lee

Where to Steer: Input-Dependent Layer Selection for Steering Improves LLM Alignment

Soham Gadgil, Chris Lin, Su-In Lee / April 7, 2026

arXiv:2604.03867v1 Announce Type: new
Abstract: Steering vectors have emerged as a lightweight and effective approach for aligning large language models (LLMs) at inference time, enabling modulation over model behaviors by shifting LLM representations…

Author name: Soham Gadgil, Chris Lin, Su-In Lee

Where to Steer: Input-Dependent Layer Selection for Steering Improves LLM Alignment