Memorize Theorems, Not Instances: Probing SFT Generalization through Mathematical Reasoning
arXiv:2605.09270v1 Announce Type: cross
Abstract: Supervised Fine-Tuning (SFT) is widely used for task-specific adaptation, yet recent work shows it systematically undermines reasoning generalization. We argue the root cause is not memorization itself…