Beyond “I cannot fulfill this request”: Alleviating Rigid Rejection in LLMs via Label Enhancement
arXiv:2605.07883v1 Announce Type: new
Abstract: Large Language Models (LLMs) rely on safety alignment to obey safe requests while refusing harmful ones. However, traditional refusal mechanisms often lead to “rigid rejection,” where a general template …