cs.LG

Entropy After </Think> for reasoning model early exiting

arXiv:2509.26522v3 Announce Type: replace
Abstract: Reasoning LLMs show improved performance with longer chains of thought. However, recent work has highlighted their tendency to overthink, continuing to revise answers even after reaching the correct …