ToxiTrace: Gradient-Aligned Training for Explainable Chinese Toxicity Detection
arXiv:2604.12321v1 Announce Type: new
Abstract: Existing Chinese toxic content detection methods mainly target sentence-level classification but often fail to provide readable and contiguous toxic evidence spans. We propose \textbf{ToxiTrace}, an expl…