Site Reliability Engineering Principles Overview – c110002gwpl
Course #: c110002gwpl
Duration: 0.8 Hours
This Site Reliability Engineering Principles Overview module covers the basic principles of Site Reliability Engineering. The module also covers the driving forces behind the practice and what risks affect service owners, as well as a method to mitigate those risks to improve customer satisfaction. In addition, this module covers the concept of using an error budget to control code release velocity.
Objectives
- Gain awareness of the driving forces behind Site Reliability
- Engineering, including velocity, reliability and availability.
- Understand how to manage risks to the happiness of users.
- Learn about the concept of an error budget.
Audience
This course is intended for learners who are pursuing professional-level site reliability engineer certification on IBM Cloud.
Prerequisites
Before starting this curriculum, the target audience should understand:
•System Thinking
•DevOps practices
•Cloud Architecture
•Software engineering principles
•System administration
•Network and OSI model
•Networking and security practices for IBM Cloud
•Incident management
•Root cause analysis
The target audience should also be able to:
•Proficiently write code
•Create run books as a reference
•Make system components serviceable
•Interpret data and statistics to determine actions
•Use LogDNA, SysDig, Grafana, Prometheus, Kibana
•Interpret schematics
•Drive incidents to resolution
•Remediate underlying sources of unreliability
•Create and configure VMs
•Create and configure Containers on IBM Kubernetes Service (IKS)/Red Hat OpenShift Kubernetes Services (ROKS)
•Create and configure Containers using OpenShift
•Create and configure Serverless applications
•Configure for high availability and scalability
Topics
Module Introduction
Topic 1: Driving Forces Behind SRE Including Velocity, Reliability, and Availability
Topic 2: Risks to the Happiness of Users
Topic 3: Error Budget
Module Summary