HE-SNR: Uncovering Latent Logic via Entropy for Guiding Mid-Training on SWE-bench
arXiv:2601.20255v2 Announce Type: replace-cross
Abstract: SWE-bench has emerged as the premier benchmark for evaluating Large Language Models on complex software engineering tasks. While these capabilities are fundamentally acquired during the mid-tra…