cs.AI, cs.CL, cs.HC

Inertia in Moral and Value Judgments of Large Language Models

arXiv:2408.09049v3 Announce Type: replace
Abstract: Large Language Models (LLMs) behave non-deterministically, and prompting has become a common method for steering their outputs. A popular strategy is to assign a persona to the model to produce more …