| As agent deployments move from demos to production, the failure modes are becoming real — agents taking unintended actions, leaking PII, running loops that cause damage before anyone notices. We have been researching runtime behavioral monitoring for AI agents and built a system that scores risk across five dimensions in real time: action type, resource sensitivity, blast radius, frequency, and context deviation. Happy to discuss the threat model and scoring approach — curious what failure modes others have encountered deploying agents in production. [link] [comments] |