cs.AI, cs.HC

Value Alignment Tax: Measuring Value Trade-offs in LLM Alignment

arXiv:2602.12134v2 Announce Type: replace
Abstract: Existing work on value alignment typically characterizes value relations statically, ignoring how alignment interventions, such as prompting, fine-tuning, or preference optimization, reshape the broa…