Ying Su, Mingen Zheng, Weili Diao, Haoran Li

Jailbreaking Large Language Models with Morality Attacks

Ying Su, Mingen Zheng, Weili Diao, Haoran Li / April 21, 2026

arXiv:2604.17053v1 Announce Type: new
Abstract: Pluralism alignment with AI has the sophisticated and necessary goal of creating AI that can coexist with and serve morally multifaceted humanity. Research towards pluralism alignment has many efforts in…

Author name: Ying Su, Mingen Zheng, Weili Diao, Haoran Li

Jailbreaking Large Language Models with Morality Attacks