Compute Aligned Training: Optimizing for Test Time Inference
arXiv:2604.24957v1 Announce Type: new
Abstract: Scaling test-time compute has emerged as a powerful mechanism for enhancing Large Language Model (LLM) performance. However, standard post-training paradigms, Supervised Fine-Tuning (SFT) and Reinforceme…