Disposition Distillation at Small Scale: A Three-Arc Negative Result
arXiv:2604.11867v1 Announce Type: cross
Abstract: We set out to train behavioral dispositions (self-verification, uncertainty acknowledgment, feedback integration) into small language models (0.6B to 2.3B effective parameters) through a four-stage all…