cs.AI, cs.CL, cs.LG

LLMs Encode Their Failures: Predicting Success from Pre-Generation Activations

arXiv:2602.09924v3 Announce Type: replace
Abstract: Running LLMs with extended reasoning on every problem is expensive, but determining which inputs actually require additional compute remains challenging. We investigate whether their own likelihood o…