Understanding Performance Gap Between Parallel and Sequential Sampling in Large Reasoning Models
arXiv:2604.05868v1 Announce Type: new
Abstract: Large Reasoning Models (LRMs) have shown remarkable performance on challenging questions, such as math and coding. However, to obtain a high quality solution, one may need to sample more than once. In pr…