cs.AI, cs.CL, cs.LG

Concurrency without Model Changes: Future-based Asynchronous Function Calling for LLMs

arXiv:2605.15077v1 Announce Type: cross
Abstract: Function calling, also known as tool use, is a core capability of modern LLM agents but is typically constrained by synchronous execution semantics. Under these semantics, LLM decoding is blocked until…