Minimizing Human Intervention in Online Classification
arXiv:2510.23557v2 Announce Type: replace
Abstract: Training or fine-tuning large language model (LLM)-based systems often requires costly human feedback, yet there is limited understanding of how to minimize such intervention while maintaining strong…