ReasoningGuard: Safeguarding Large Reasoning Models with Inference-time Safety Aha Moments
arXiv:2508.04204v2 Announce Type: replace
Abstract: Large Reasoning Models (LRMs) have demonstrated impressive performance in reasoning-intensive tasks, but they remain vulnerable to harmful content generation, particularly in the mid-to-late steps of…