A Layer-wise Analysis of Supervised Fine-Tuning
arXiv:2604.11838v1 Announce Type: cross
Abstract: While critical for alignment, Supervised Fine-Tuning (SFT) incurs the risk of catastrophic forgetting, yet the layer-wise emergence of instruction-following capabilities remains elusive. We investigate…