Zixuan Xie, Xinyu Liu, Rohan Chandra, Shangtong Zhang

Convergence and Emergence of In-Context Reinforcement Learning with Chain of Thought

Zixuan Xie, Xinyu Liu, Rohan Chandra, Shangtong Zhang / May 11, 2026

arXiv:2605.07123v1 Announce Type: new
Abstract: In-context reinforcement learning (ICRL) refers to the ability of RL agents to adapt to new tasks at inference time without parameter updates by conditioning on additional context. Recent empirical studi…

Author name: Zixuan Xie, Xinyu Liu, Rohan Chandra, Shangtong Zhang

Convergence and Emergence of In-Context Reinforcement Learning with Chain of Thought