cs.CL

Jailbreaking Large Language Models with Morality Attacks

arXiv:2604.17053v1 Announce Type: new
Abstract: Pluralism alignment with AI has the sophisticated and necessary goal of creating AI that can coexist with and serve morally multifaceted humanity. Research towards pluralism alignment has many efforts in…