Huije Lee, Jisu Shin, Hoyun Song, Changgeon Ko, Jong C. Park

Beyond Static Benchmarks: Synthesizing Harmful Content via Persona-based Simulation for Robust Evaluation

Huije Lee, Jisu Shin, Hoyun Song, Changgeon Ko, Jong C. Park / April 21, 2026

arXiv:2604.17020v1 Announce Type: new
Abstract: Static benchmarks for harmful content detection face limitations in scalability and diversity, and may also be affected by contamination from web-scale pre-training corpora. To address these issues, we p…

Author name: Huije Lee, Jisu Shin, Hoyun Song, Changgeon Ko, Jong C. Park

Beyond Static Benchmarks: Synthesizing Harmful Content via Persona-based Simulation for Robust Evaluation