Query-Conditioned Test-Time Self-Training for Large Language Models
arXiv:2605.13369v2 Announce Type: replace-cross
Abstract: Large language models (LLMs) are typically deployed with fixed parameters, and their performance is often improved by allocating more computation at inference time. While such test-time scaling…