Don't Panic!




How to Cope Now You’re Responsible for Production

More and more developers are expected to be on-call, provide out-of-hours support, and respond to production outages. Without much experience handling incidents, it can be scary, intimidating, and feel like being dropped in the deep end. But it doesn’t have to be that way!

Over two years on the FT’s Content team, we’ve transformed our incident response – from a number of mildly terrifying multi-hour outages, to a stable platform where team members feel comfortable on-call.

This talk will provide practical tips and advice on:

  • setting up an incident response framework
  • what to do when Everything Is On Fire™
  • improving things afterwards
  • and some horror stories of our own…

Speaker

euan-finlay

Euan Finlay

 
Euan currently works across multiple teams at the Financial Times, helping to support Java & Go microservices, Docker containers in Kubernetes, and the website as a whole. As someone on the ...