Control protocols don’t always need to know which models are scheming
These are my personal views.To detect if an agent is taking a catastrophically dangerous action, you might want to monitor its actions using the smartest model that is too weak to be a schemer. But knowing what models are weak enough that they are unli…