cs.AI

Automated alignment is harder than you think

arXiv:2605.06390v1 Announce Type: new
Abstract: A leading proposal for aligning artificial superintelligence (ASI) is to use AI agents to automate an increasing fraction of alignment research as capabilities improve. We argue that, even when research …