cs.CL, cs.LG

CREST: Universal Safety Guardrails Through Cluster-Guided Cross-Lingual Transfer

arXiv:2512.02711v2 Announce Type: replace-cross
Abstract: Ensuring content safety in large language models (LLMs) is essential for their deployment in real-world applications. However, existing safety guardrails are predominantly tailored for high-res…