cs.LG, stat.ML

Minimizing Human Intervention in Online Classification

arXiv:2510.23557v2 Announce Type: replace
Abstract: Training or fine-tuning large language model (LLM)-based systems often requires costly human feedback, yet there is limited understanding of how to minimize such intervention while maintaining strong…