cs.CL, cs.CR, cs.CV, cs.LG

Red-Teaming Text-to-Image Models via In-Context Experience Replay and Semantic-Preserving Prompt Rewriting

arXiv:2411.16769v3 Announce Type: replace-cross
Abstract: Understanding the capabilities of text-to-image (T2I) models in harmful content generation is essential to safety and compliance. However, human red-teaming is costly and inconsistent, driving …