Discovering Learning-Friendly Generation Orders for Sequential Computation
arXiv:2506.23875v4 Announce Type: replace
Abstract: Sequential computation via autoregressive generation can make difficult tasks learnable, but the generation order of intermediate states strongly affects whether training succeeds. We address the pro…