When Prompts Interact: Assessing Prompt Arithmetic for Deconfounding under Distribution Shift
arXiv:2605.03096v1 Announce Type: new
Abstract: In classification tasks, models may rely on confounding variables to achieve strong in-distribution performance, capturing spurious features that fail under distribution shift. This shortcut behavior lea…