DOI-10008
Learn how to leverage the advanced techniques and tools to enable scaling, resiliency, responsive, and agility to your organization.
Site Reliability Engineers focus on bringing speed and agility by automating toils, and reduce failures, and optimize ticket responsiveness and adopting SRE into well structured organizations.
Overivew
Organizations always facing challenges of scale, while we accept failures as norm, still the challenge to scale and ensure reliability with agility.
With Site Reliability Engineering Foundation course offered by PeopleCert for Exam Certification is a creditable 16 hours of approved certification requirements to prepare students to pass the exam.
Exam vouchers are offered with this course, and booked and managed by PeopleCert.
This course, covers the concepts and the fundamentals of Site Reliability Engineering Foundation in real world challenges.
Prerequisites
Knowledge
Students to this class are expected to have:
- Basic knowledge of sites operations
- Basic understanding of computer operations skills :such as managing files
Technology
Depending on the delivery method of this course, the students should have :
- A Workstation with Internet browser capability such as (Chrome, Edge, or Safari)
- Good persistent internet connection without blocking firewalls(ideally non corporate firewall protected workstations)
The Workshops
Workshops is a collaborative effort so students can demonstrate their ability to lead and contribute within a team with various skills.
Workshops covered in this course:
- Workshop 1: Analyzing and Detecting Toil
- Workshop 2: Develop SLO, SLI and Observability
Objectives
Students who completed this course, should build the skills and knowledge that allows them to
Audience
This course is designed to assist and equip the students with the skills and knowledge that allows them to perfect their daily tasks with respect to operationalize the CI/CD pipelines with confidence and capitalize the organization investment to operate reliable business.
- Scrum Masters: Understand and publish meaningful observability metrics to team’s performance dashboards
- Product Owners: understand the business value of time to respond to incidents and to ensure business continuity and reliability metrics at bar.
- Application Architects: Understand the impact and importance of SLOs and SLIs and how to architect observable applications
- Software Developers: Understand the significance and relevancy of metrics and SLIs
- Security Architects: Ensure the SRE practices are secured and within the security guidelines
- SRE Site Reliability Engineers: Learn skills and knowledge to develop realistic SLOs, and SLIs, Discover and optimize TOIL within the CI/CD pipeline
- Systems Architects: Build infrastructure environments within SRE and Security practices
- Help Desk staff: Understand the SLO and ticket response mechanism and optimize toil
Timeline
The Site Reliability Engineering-SRE Foundation certification Course is a 2 days course, includes lectures, demos, and workshops.
The following is guidelines for the instructor to organize the time pace with the students, subject to change based on students preference.
Breaks during the day follows the 106 rule, every 45-60m
*the 106 rule, indicates the human memory capacity to learn the new factual elements which is 106 facts before the memory could be reused.








Course Curriculum
SRE Principles & Practices
- What is Site Reliability Engineering?
- SRE & DevOps – What is the Difference?
- SRE Practices
Service Level Objectives & Error Budgets
- Service Level Objectives
- Error Budgets
- Error Budget Policies

Reducing Toil
- What is Toil?
- Why toil is bad?
- Doing something about toil
Monitoring & Service Level Indicators
- SLI’s – Service Level Indicators
- Monitoring
- Observability

SRE Tools & Automation
- Automation Defined
- Automation Focus
- Hierarchy of Automation Types
- Secure Automation
- Automation Tools
Anti-Fragility & Learning from Failure
- Why learn from failure
- Benefits of anti-fragility
- Shifting the Organizational balance
Organizational Impact of SRE
- Why organizations embrace SRE
- Pattern for SRE adoption
- SRE Job Description
- Sustainable Incident Response
- Blameless post mortems
- SRE & Scale
SRE, Other Frameworks, The Future
- SRE & Other Frameworks
- SRE Evolution
Calendar
Scroll through the months, and chose the right schedule for you, send us a standard request form register