Professional Training

SRE Fundamentals and Security

edX, Online
Length
5 weeks
Next course start
Start anytime See details
Course delivery
Self-Paced Online
Length
5 weeks
Next course start
Start anytime See details
Course delivery
Self-Paced Online
Visit this course's homepage on the provider's site to learn more or book!

Course description

SRE Fundamentals and Security

Site Reliability Engineers must have the right tools and strategies to perform in a technical, fast-paced environment. IBM Cloud SRE is guided by nine competency areas that lead to the successful practice of the discipline:

  • Applying Site Reliability Engineering principles
  • Operations
  • Monitoring and incident management
  • Security and compliance
  • Compute infrastructure
  • Networking
  • Storage and data management
  • Reliability and resiliency
  • Deployment automation

In this first course of the three-part Professional Certificate in Site Reliability Engineering (SRE), you will focus on the first four SRE competencies:

  • Applying Site Reliability Engineering principles
  • Operations
  • Monitoring and incident management
  • Security and compliance

Upcoming start dates

1 start date available

Start anytime

  • Self-Paced Online
  • Online
  • English

Suitability - Who should attend?

Prerequisites

At least 1 year experience in SRE or technology.

Understanding of:

  • DevOps practices
  • Software engineering principles
  • System administration
  • Network and OSI model
  • Incident management
  • Root cause analysis

Training Course Content

Welcome and Introduction

You will cover the following topics:

An introduction to the IBM Professional SRE role

SRE Fundamentals and Terminology

You will cover the following topics:

  • Deeper dive into SRE role
  • SRE principles
  • Managing trade-offs between change, velocity, and reliability
  • Negotiating service level objectives, service level indicators, error budgets and the user experience
  • IBM Cloud tools and technology across the Software Development Life Cycle
  • Applying software engineering principles to drive reliability

Operations

You will cover the following topics:

  • Performing operational readiness reviews (ORR) on IBM Cloud
  • Creating ORR checklist
  • Employing cost-optimization strategies
  • Managing backups and recoveries on IBM Cloud

Monitoring

You will cover the following topics:

  • Monitoring overview
  • Creating and maintaining metrics, traces, and alerts on IBM Cloud
  • Collecting, analyzing, and managing logs on IBM Cloud
  • Identifying key metrics for service health on IBM Cloud
  • Using performance and availability metrics to measure the health of services on IBM Cloud

Incident Management

You will cover the following topics:

  • Managing incidents on IBM Cloud
  • Developing a balanced action plan to mitigate future incidents
  • Performing the post-incident review

Security and Compliance

You will cover the following topics:

  • Monitoring and managing security threats on IBM Cloud
  • Implementing and managing security policies on IBM Cloud
  • Implementing encryption models
  • Managing role-based access control on IBM Cloud

Course delivery details

This course is offered through IBM, a partner institute of EdX.

2-3 hours per week

Expenses

  • Verified Track -$99
  • Audit Track - Free
Ads