cs.AI, cs.CL, cs.LG

Beyond Linear Steering: Unified Multi-Attribute Control for Language Models

arXiv:2505.24535v3 Announce Type: replace-cross
Abstract: Controlling multiple behavioral attributes in large language models (LLMs) at inference time is a challenging problem due to interference between attributes and the limitations of linear steeri…