CODA: Difficulty-Aware Compute Allocation for Adaptive Reasoning
arXiv:2603.08659v2 Announce Type: replace
Abstract: The emergence of large reasoning models demonstrates that scaling inference-time compute significantly enhances performance on complex tasks. However, it often falls into another trap: overthinking s…