VibeServe: Can AI Agents Build Bespoke LLM Serving Systems?
arXiv:2605.06068v1 Announce Type: new
Abstract: For years, we have built LLM serving systems like any other critical infrastructure: a single general-purpose stack, hand-tuned over many engineer-years, meant to support every model and workload. In thi…