AI Safety at the Frontier: Paper Highlights of February & March 2026
tl;drPaper of the month:A benchmark of 56 model organisms with hidden behaviors finds that auditing-tool rankings depend heavily on how the organism was trained — and the investigator agent, not the tools, is the bottleneck.Research highlights:Linear “…