SRE Principles
Basic concepts of SRE based on Google SRE book, relationship between development and operations, and SRE tenets
Error Budget
Concept of Error Budget, calculation method, and YouTube case study
Postmortem (Incident Analysis)
Purpose, implementation methods, templates, and best practices for postmortems in SRE
Toil Definition and Management
Explains the definition of toil in SRE, why reducing it is important, and the difference between toil and engineering work.
SRE and Observability
Definition of Observability, differences from Monitoring, Three Pillars, and benefits for SRE based on IBM technical documentation.