Loading…

Note: Meeting Room 7 will be available as an On-Call Room for attendees.

Back To Schedule
Thursday, August 31 • 10:50 - 11:45
Case Study: Lessons Learned from Our First Worldwide Outage

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Last year, on March 10, Incapsula experienced the first worldwide outage in its history… While relatively short in duration, it affected thousands of websites that rely on our security and acceleration every day.

Rooted in a 3-year old dormant bug in our IncapRules code, this outage made us realize there were changes we needed to make in the way we write and qualify code. As VP of Engineering, the faulty code and our testing procedures are my responsibility, and it was up to me to lead the team to achieve an order of magnitude higher reliability.

One of the key things we were missing was a way to propagate customer configuration across our network in a way that is fast but without compromising on safety. The result was a new configuration sandbox system which achieved that.

In this talk I’ll present the process we took to analyze the true reliability of our system and the framework we use to reason about it, to prioritize tasks across teams and to design a more reliable service.

Speakers
YC

Yoav Cohen

Imperva Incapsula
Yoav is VP of Engineering for Imperva Incapsula, and has been with the company since they made their first sale. In between meetings you will find him working on build systems or nasty performance bugs. When not doing so he tries to sneak a few minutes on his guitar or doing laps... Read More →


Thursday August 31, 2017 10:50 - 11:45 IST
Pembroke Room