Convergence and Emergence of In-Context Reinforcement Learning with Chain of Thought
arXiv:2605.07123v1 Announce Type: new
Abstract: In-context reinforcement learning (ICRL) refers to the ability of RL agents to adapt to new tasks at inference time without parameter updates by conditioning on additional context. Recent empirical studi…