cs.AI, cs.CL, cs.LG

When Less is Enough: Efficient Inference via Collaborative Reasoning

arXiv:2605.01111v1 Announce Type: cross
Abstract: In this work, we introduce DUET (Dual-model Efficient Two-stage inference), a collaborative inference framework in which a capable model and a lightweight model work together to solve a task. Relying o…