Tandem: Riding Together with Large and Small Language Models for Efficient Reasoning
arXiv:2604.23623v1 Announce Type: new
Abstract: Recent advancements in large language models (LLMs) have catalyzed the rise of reasoning-intensive inference paradigms, where models perform explicit step-by-step reasoning before generating final answer…