Understanding the Importance of Reliability and Resiliency for Services – c110042gwpl

Course #: c110042gwpl

Duration: 0.8 Hours

The Understanding the Importance of Reliability and Resiliency for Services module covers how reliability and resiliency are major considerations when building a system or service. The module also covers and how to incorporate these considerations into the build plan using key architectural patterns.


  • Understand reliability concepts
  • Understand resiliency concepts
  • Learn how to use IBM Cloud Regions and Availability Zones
  • Build a Reliable Service on IBM Cloud


This course is intended for learners who are pursuing professional-level site reliability engineer certification on IBM Cloud.


Before starting this curriculum, the target audience should understand:
•System Thinking
•DevOps practices
•Cloud Architecture
•Software engineering principles
•System administration
•Network and OSI model
•Networking and security practices for IBM Cloud
•Incident management
•Root cause analysis

The target audience should also be able to:
•Proficiently write code
•Create run books as a reference
•Make system components serviceable
•Interpret data and statistics to determine actions
•Use LogDNA, SysDig, Grafana, Prometheus, Kibana
•Interpret schematics
•Drive incidents to resolution
•Remediate underlying sources of unreliability
•Create and configure VMs
•Create and configure Containers on IBM Kubernetes Service (IKS)/Red Hat OpenShift Kubernetes Services (ROKS)
•Create and configure Containers using OpenShift
•Create and configure Serverless applications
•Configure for high availability and scalability


Module Introduction
Topic 1: Reliability Overview
Topic 2: Resiliency Overview
Topic 3: IBM Cloud Region and IBM Cloud Availability Zone
Topic 4: Build a Reliable Service on IBM Cloud
Module Summary

