cs.AI, cs.CL, cs.LG

Parallel Test-Time Scaling for Latent Reasoning Models

arXiv:2510.07745v4 Announce Type: replace
Abstract: Parallel test-time scaling (TTS) is a pivotal approach for enhancing large language models (LLMs), typically by sampling multiple token-based chains-of-thought in parallel and aggregating outcomes th…