ONLINE: Site Reliability Engineering Part 2
Site Reliability Engineering (SRE) is a ‘SRE-ious’ movement that breaks down the traditional barriers and ends the age-old battles between Development & Operations teams. SRE essentially creates a hybrid role that tries to maintain an equilibrium between developing new features and running production systems reliably.
This is a CMG member only event, sign in or join today to register.
This is a 3 part webinar series that attempts to explore SRE from its origins, an insight into its specific terminology to its current evolution as a standard to maintain or run production systems. Each part either tries to answer a series of questions or explores a set of inter-related topics – in an attempt to go in-depth and provide an overall picture
Part 2 – SRE – Defining critical practices based on Service Reliability Hierarchy ? This part goes into various elements that essentially makes a service reliable. These elements include:
- Incident Response
- Postmortem / Root Cause Analysis
- Testing + Release Procedures
- Capacity Planning