Where Do Reasoning Models Refuse?
arXiv:2507.03167v4 Announce Type: replace-cross
Abstract: Chat models without chain-of-thought (CoT) reasoning must decide whether to refuse a harmful request before generating their first response token. Reasoning models, by contrast, produce extende…