Takes on Automating Alignment
Recently, AI Models have become good at long horizon tasks. In addition, they seem especially good at the types of long horizon tasks that allow for a quick and short feedback loop. For example, on MirrorCode, models were able to generate tens of thous…