cs.AI

Student Guides Teacher: Weak-to-Strong Inference via Spectral Orthogonal Exploration

arXiv:2601.06160v2 Announce Type: replace
Abstract: Large Language Models (LLMs) often suffer from ”Reasoning Collapse” on challenging mathematical reasoning tasks, where stochastic sampling produces lexical variations of the same erroneous logic ra…