cs.LG

How LLMs Detect and Correct Their Own Errors: The Role of Internal Confidence Signals

arXiv:2604.22271v1 Announce Type: new
Abstract: Large language models can detect their own errors and sometimes correct them without external feedback, but the underlying mechanisms remain unknown. We investigate this through the lens of second-order …