cs.AI, cs.CR

GUARD-SLM: Token Activation-Based Defense Against Jailbreak Attacks for Small Language Models

arXiv:2603.28817v1 Announce Type: cross
Abstract: Small Language Models (SLMs) are emerging as efficient and economically viable alternatives to Large Language Models (LLMs), offering competitive performance with significantly lower computational cost…