AI agents are now part of our automation layer, calling tools, modifying systems, and influencing production workflows. Unlike traditional automation, their behavior can be unpredictable under pressure. This talk explores how we stress-test AI agents embedded in production systems using red teaming techniques to surface hidden failure modes and operational risks before they escalate into customer-facing incidents.
