No matter how you define it, the Site Reliability Engineer (SRE) role is clearly expanding into more and more companies. To be effective in this new role, SREs must possess a depth of understanding of how different systems work together, how they fail, how they can be improved, and how they can best be designed and monitored.

The code’s been written. It’s been reviewed. It’s been tested. The build has passed and it’s finally time to deploy, but will this thing actually work in production? Every team aspires to accelerate their development cycle, but increasing velocity while maintaining quality is hard.