V\'ictor Gallego - Provide.ai

Discovering Agentic Safety Specifications from 1-Bit Danger Signals

V\'ictor Gallego / April 28, 2026

arXiv:2604.23210v1 Announce Type: new
Abstract: Can large language model agents discover hidden safety objectives through experience alone? We introduce EPO-Safe (Experiential Prompt Optimization for Safe Agents), a framework where an LLM iteratively …

Author name: V\'ictor Gallego

Discovering Agentic Safety Specifications from 1-Bit Danger Signals