Anubhab Sahu, Diptisha Samanta, Reza Soosahabi

Automated Framework to Evaluate and Harden LLM System Instructions against Encoding Attacks

Anubhab Sahu, Diptisha Samanta, Reza Soosahabi / April 2, 2026

arXiv:2604.01039v1 Announce Type: cross
Abstract: System Instructions in Large Language Models (LLMs) are commonly used to enforce safety policies, define agent behavior, and protect sensitive operational context in agentic AI applications. These inst…

Author name: Anubhab Sahu, Diptisha Samanta, Reza Soosahabi

Automated Framework to Evaluate and Harden LLM System Instructions against Encoding Attacks