When AI Persuades: Adversarial Explanation Attacks on Human Trust in AI-Assisted Decision Making
arXiv:2602.04003v3 Announce Type: replace
Abstract: Most adversarial threats in artificial intelligence (AI) target the computational behavior of models rather than the humans who rely on them. Yet modern AI systems increasingly operate within human d…