cs.AI

OpenDeepThink: Parallel Reasoning via Bradley–Terry Aggregation

arXiv:2605.15177v1 Announce Type: new
Abstract: Test-time compute scaling is a primary axis for improving LLM reasoning. Existing methods primarily scale depth by extending a single reasoning trace. Scaling breadth by sampling multiple candidates in p…