cs.AI

Thinking Sparks!: Emergent Attention Heads in Reasoning Models During Post Training

arXiv:2509.25758v2 Announce Type: replace
Abstract: The remarkable capabilities of modern large reasoning models are largely unlocked through post-training techniques such as supervised fine-tuning (SFT) and reinforcement learning (RL). However, the a…