Tanmay Gautam, Alireza Bahramali, Sandeep Atluri

AutoRISE: Agent-Driven Strategy Evolution for Red-Teaming Large Language Models

Tanmay Gautam, Alireza Bahramali, Sandeep Atluri / April 28, 2026

arXiv:2604.22871v1 Announce Type: cross
Abstract: Automated red-teaming methods for large language models typically optimize attack prompts within a fixed, human-designed strategy, leaving the attack strategy itself unchanged. We instead optimize the …

Author name: Tanmay Gautam, Alireza Bahramali, Sandeep Atluri

AutoRISE: Agent-Driven Strategy Evolution for Red-Teaming Large Language Models