cs.CL

Detoxification for LLM: From Dataset Itself

arXiv:2604.19124v1 Announce Type: new
Abstract: Existing detoxification methods for large language models mainly focus on post-training stage or inference time, while few tackle the source of toxicity, namely, the dataset itself. Such training-based o…