Overivew
Organizations always facing challenges of scale, while we accept failures as norm, still the challenge to scale and ensure reliability with agility.
With Site Reliability Engineering Foundation course offered by PeopleCert for Exam Certification is a creditable 16 hours of approved certification requirements to prepare students to pass the exam.Â
Exam vouchers are offered with this course, and booked and managed by PeopleCert.
This course, covers the concepts and the fundamentals of Site Reliability Engineering Foundation in real world challenges.
Prerequisites
Knowledge
Students to this class are expected to have:
- Basic knowledge of sites operations
- Basic understanding of computer operations skills :such as managing files
Technology
Depending on the delivery method of this course, the students should have :
- A Workstation with Internet browser capability such as (Chrome, Edge, or Safari)
- Good persistent internet connection without blocking firewalls(ideally non corporate firewall protected workstations)
The Workshops
Workshops is a collaborative effort so students can demonstrate their ability to lead and contribute within a team with various skills.
Workshops covered in this course:
- Workshop 1: Day 1 post-class assignmentÂ
- Workshop 2: Day 2 post-class assignment
Objectives
Students who completed this course, should build the skills and knowledge that allows them to
Audience
This course is designed to assist and equip the students with the skills and knowledge that allows them to perfect their daily tasks with respect to operationalize the CI/CD pipelines with confidence and capitalize the organization investment to operate reliable business.
- Scrum Masters: Understand and publish meaningful observability metrics to team’s performance dashboards
- Product Owners: understand the business value of time to respond to incidents and to ensure business continuity and reliability metrics at bar.
- Application Architects: Understand the impact and importance of SLOs and SLIs and how to architect observable applications
- Software Developers: Understand the significance and relevancy of metrics and SLIs
- Security Architects: Ensure the SRE practices are secured and within the security guidelines
- SRE Site Reliability Engineers: Learn skills and knowledge to develop realistic SLOs, and SLIs, Discover and optimize TOIL within the CI/CD pipeline
- Systems Architects: Build infrastructure environments within SRE and Security practices
- Help Desk staff: Understand the SLO and ticket response mechanism and optimize toil
Timeline
The Site Reliability Engineering-SRE Practitioner certification Course is a 3 days course, includes lectures, demos, and workshops.
The following is guidelines for the instructor to organize the time pace with the students, subject to change based on students preference.
Breaks during the day follows the 106 rule, every 45-60mÂ
*the 106 rule, indicates the human memory capacity to learn the new factual elements which is 106 facts before the memory could be reused.








Course Curriculum
Module 1: SRE Anti-Patterns
- DevOps and SRE blueprint
- SRE in distributed ecosystems
- SRE Barriers
- SRE Anti-Patterns
- Monzo bank Case study
Module 2: SLO is a Proxy for Customer Happiness
- SLO – Service Level Objectives
- System boundaries for SLI
- Error Budget between velocity and stability

Module 3: Building Secure and Reliable Systems
- Build Secure and Reliable Systems
- NALSD Non-Abstract Large Scale Design
- Designing for the changing Architecture and Distributed ecosystems
- Fault tolerance Design
- Designing for Security
- Designing for Resiliency
Module 4: Full Stack Observability
- Modern Apps are complex and unpredictable
- Slow is the New Down
- Pillars of Observability
- Using Open Telemetry

Module 5: AIOps
- AIOps and ITOps
- Importance of DataOps
- AIOps Maturity
- Secure Automation
- Automation Tools
Module 6: SRE and Incident Response Management
- SRE Key responsibilities towards incident response
- DevOps and SRE and ITSM
- OODA and SRE Incident Response
- SRE and CLR (Closed Loop Remediation)
- Swarming – Food for Thought
- AI/ML for better Incident Management

Module 7: Chaos Engineering
- Navigating Complexity
- Chaos Engineering Defined
- Quick Facts
- Chaos Monkey Origin Story
- Who is adapting Chaos Engineering
- Myths of Chaos
- GameDay Exercises
- Security Chaos Engineering
- Chaos Engineering Resources
Module 8: SRE is the Purest Form of DevOps
- Key Principles of SRE
- SREs help increase Reliability across the spectrum
- Metrics for Success
- SRE Execution models
- Culture and Behavioral Skills are key
- Transformation after implementing SRE practices
Calendar
Scroll through the months, and chose the right schedule for you, send us a standard request form register