cs.AI, cs.LG

Disposition Distillation at Small Scale: A Three-Arc Negative Result

arXiv:2604.11867v1 Announce Type: cross
Abstract: We set out to train behavioral dispositions (self-verification, uncertainty acknowledgment, feedback integration) into small language models (0.6B to 2.3B effective parameters) through a four-stage all…