cs.AI, cs.DC

SMART: When is it Actually Worth Expanding a Speculative Tree?

arXiv:2604.09731v1 Announce Type: cross
Abstract: Tree-based speculative decoding accelerates autoregressive generation by verifying a branching tree of draft tokens in a single target-model forward pass. However, existing methods prioritize maximizin…