Olli J\"arviniemi, Oliver Makins, Jacob Merizian, Robert Kirk, Ben Millwood

Propensity Inference: Environmental Contributors to LLM Behaviour

Olli J\"arviniemi, Oliver Makins, Jacob Merizian, Robert Kirk, Ben Millwood / April 24, 2026

arXiv:2604.21098v1 Announce Type: new
Abstract: Motivated by loss of control risks from misaligned AI systems, we develop and apply methods for measuring language models’ propensity for unsanctioned behaviour. We contribute three methodological improv…

Author name: Olli J\"arviniemi, Oliver Makins, Jacob Merizian, Robert Kirk, Ben Millwood

Propensity Inference: Environmental Contributors to LLM Behaviour