Collecting, Analyzing, and Managing Logs on IBM Cloud – c110016gwpl
Course #: c110016gwpl
Duration: 0 Days
This module describes how to collect, analyze, and manage event logs on IBM Cloud. It discusses the implementation of visual references of patterns and key patterns.
Objectives
- Learn to configure logging on IBM Cloud
- Learn to identify patterns that indicate potential service health issues
- Learn to identify thresholds that should alert SREs to potential service health problems
- Learn to implement dashboards that visualize service health
Audience
This course is intended for learners who are pursuing professional-level site reliability engineer certification on IBM Cloud.
Prerequisites
Before starting this curriculum, the target audience should understand:
•System Thinking
•DevOps practices
•Cloud Architecture
•Software engineering principles
•System administration
•Network and OSI model
•Networking and security practices for IBM Cloud
•Incident management
•Root cause analysis
The target audience should also be able to:
•Proficiently write code
•Create run books as a reference
•Make system components serviceable
•Interpret data and statistics to determine actions
•Use LogDNA, SysDig, Grafana, Prometheus, Kibana
•Interpret schematics
•Drive incidents to resolution
•Remediate underlying sources of unreliability
•Create and configure VMs
•Create and configure Containers on IBM Kubernetes Service (IKS)/Red Hat OpenShift Kubernetes Services (ROKS)
•Create and configure Containers using OpenShift
•Create and configure Serverless applications
•Configure for high availability and scalability
Topics
Module Introduction
Topic 1: Configuring the Logging Tool
Topic 2: Identifying Key Patterns That Highlight Service Health
Topic 3: Implementing Visual References of These Patterns (Dashboards, Views)
Topic 4: Identifying Key Patterns to Alert SREs to Potential Problems (Thresholds)
Module Summary