keshavs - Provide.ai

Uncategorised

Introspection Adapters: Training LLMs to Report Their Learned Behaviors

keshavs / April 28, 2026

Authors: Keshav Shenoy, Li Yang, Abhay Sheshadri, Soren Mindermann, Jack Lindsey, Sam Marks, and Rowan Wang📄Paper, 💻 Code, 🤖ModelsTL;DR: We introduce introspection adapters (IA), a technique for training an LLM to self-report behaviors it learned durin…

Author name: keshavs

Introspection Adapters: Training LLMs to Report Their Learned Behaviors