Reliable Control-Point Selection for Steering Reasoning in Large Language Models
arXiv:2604.02113v1 Announce Type: new
Abstract: Steering vectors offer a training-free mechanism for controlling reasoning behaviors in large language models, but constructing effective vectors requires identifying genuine behavioral signals in the mo…