cs.AI, cs.CL, cs.LG

Negation Neglect: When models fail to learn negations in training

arXiv:2605.13829v1 Announce Type: cross
Abstract: We introduce Negation Neglect, where finetuning LLMs on documents that flag a claim as false makes them believe the claim is true. For example, models are finetuned on documents that convey “Ed Sheeran…