cs.AI, physics.soc-ph

Fusion-fission forecasts when AI will shift to undesirable behavior

arXiv:2605.14218v1 Announce Type: new
Abstract: The key problem facing ChatGPT-like AI’s use across society is that its behavior can shift, unnoticed, from desirable to undesirable — encouraging self-harm, extremist acts, financial losses, or costly …