Incident Management Simulation Day with the SRE and Monitor:Health teams

Overview

The Monitor: Health team owns Category:Alert Management (planned) and Category:Incident Management (viable). Internal customers for these categories include members of the SRE team. As we work to mature these categories into products that will be loved in market, one of the first steps we will need to take is building something that our internal customers can dogfood.

Alert Management and Incident Management involve highly critical workflows that must be highly reliable for. In other words, before the SRE team can begin using GitLab to triage alert and respond to Incidents, we need to build add more important functionality and demonstrate that GitLab can be used reliably.