cs.CL, cs.CY

SafeMath: Inference-time Safety improves Math Accuracy

arXiv:2603.25201v1 Announce Type: new
Abstract: Recent research points toward LLMs being manipulated through adversarial and seemingly benign inputs, resulting in harmful, biased, or policy-violating outputs. In this paper, we study an underexplored i…