cs.CL, cs.LG, cs.NE

AP-BMM: Approximating Capability-Efficiency Pareto Sets of LLMs via Asynchronous Prior-guided Bayesian Model Merging

arXiv:2512.09972v5 Announce Type: replace-cross
Abstract: Navigating the capability–efficiency trade-off in Large Language Models (LLMs) requires approximating a high-quality Pareto set. Existing model merging research has focused predominantly on co…