Menu

COMING SOON – Creating and Maintaining Metrics, Traces, and Alerts on IBM Cloud – c110015gwpl

Course #: c110015gwpl

Duration: 0 Days

This module enables you to collect, analyze, and manage logs on IBM Cloud. The module also covers using IBM Log Analysis with LogDNA to configure log sources, view logs, and work with alerts.

Objectives

  • Identify key metrics for service availability
  • Describe how to select tools to help identify service issues
  • Identify patterns to trace root cause
  • Identify situations that require alerts and sources of alerts

Audience

This course is intended for learners who are pursuing professional-level site reliability engineer certification on IBM Cloud.

Prerequisites

Before starting this curriculum, the target audience should understand:
•System Thinking
•DevOps practices
•Cloud Architecture
•Software engineering principles
•System administration
•Network and OSI model
•Networking and security practices for IBM Cloud
•Incident management
•Root cause analysis

The target audience should also be able to:
•Proficiently write code
•Create run books as a reference
•Make system components serviceable
•Interpret data and statistics to determine actions
•Use LogDNA, SysDig, Grafana, Prometheus, Kibana
•Interpret schematics
•Drive incidents to resolution
•Remediate underlying sources of unreliability
•Create and configure VMs
•Create and configure Containers on IBM Kubernetes Service (IKS)/Red Hat OpenShift Kubernetes Services (ROKS)
•Create and configure Containers using OpenShift
•Create and configure Serverless applications
•Configure for high availability and scalability

Topics

Module Introduction

Topic 1: Key Metrics for Service Availability

Topic 2: Tools Used for Identifying Service Issues

Topic 3: Patterns to Trace Service Issue Root Causes

Topic 4: Situations That Require Alerts

Module Summary

Contact us regarding the training