cs.AI, cs.HC

Alignment has a Fantasia Problem

arXiv:2604.21827v1 Announce Type: new
Abstract: Modern AI assistants are trained to follow instructions, implicitly assuming that users can clearly articulate their goals and the kind of assistance they need. Decades of behavioral research, however, s…