cs.CL, cs.LG

Reward-Based Online LLM Routing via NeuralUCB

arXiv:2603.30035v1 Announce Type: cross
Abstract: This study investigates the use of NeuralUCB for cost-aware large language model (LLM) routing. Existing routing approaches can be broadly grouped into supervised routing methods and partial-feedback m…