Dynamic Mixed-Precision Routing for Efficient Multi-step LLM Interaction
arXiv:2602.02711v2 Announce Type: replace
Abstract: Large language models (LLMs) achieve strong performance in long-horizon decision-making tasks through multi-step interaction and reasoning at test time. While practitioners commonly believe a higher …