Conformity Generates Collective Misalignment in AI Agents Societies
arXiv:2605.10721v1 Announce Type: cross
Abstract: Artificial intelligence safety research focuses on aligning individual language models with human values, yet deployed AI systems increasingly operate as interacting populations where social influence …