One Refiner to Unlock Them All: Inference-Time Reasoning Elicitation via Reinforcement Query Refinement
arXiv:2604.25444v1 Announce Type: new
Abstract: Large Language Models (LLMs) often fail to utilize their latent reasoning capabilities due to a distributional mismatch between ambiguous human inquiries and the structured logic required for machine act…