Information-Theoretic Limits of Safety Verification for Self-Improving Systems
arXiv:2603.28650v2 Announce Type: replace
Abstract: Can a safety gate permit unbounded beneficial self-modification while maintaining bounded cumulative risk? We formalize this question through dual conditions — requiring sum delta_n < infinity (boun…