When Less is Enough: Efficient Inference via Collaborative Reasoning
arXiv:2605.01111v1 Announce Type: cross
Abstract: In this work, we introduce DUET (Dual-model Efficient Two-stage inference), a collaborative inference framework in which a capable model and a lightweight model work together to solve a task. Relying o…