Hint Tuning: Less Data Makes Better Reasoners
arXiv:2605.08665v1 Announce Type: new
Abstract: Large reasoning models achieve high accuracy through extended chain-of-thought but generate 5–8 more tokens than necessary, applying verbose reasoning uniformly regardless of problem difficulty. We prop…