Decomposing and Steering Functional Metacognition in Large Language Models
arXiv:2605.08942v1 Announce Type: new
Abstract: Large language models (LLMs) increasingly exhibit behaviors suggesting awareness of their evaluation context, often adapting their reasoning strategies in benchmark settings. Prior work has shown that su…