-
Course
- Core Tech
SRE: Resiliency and Automation
Learn how SRE teams build resilient systems through architectural patterns, automation, and operational excellence. Transform unreliable applications into production-ready services that scale efficiently and recover gracefully from failures.
What you'll learn
Modern Site Reliability Engineering (SRE) teams face a key challenge: ensuring operational excellence without being buried in manual work. In this course, SRE: Resiliency and Automation, you’ll gain the skills needed to transform complex, fragile applications into resilient, self-healing systems. First, you’ll explore how to architect for resilience using distributed systems patterns like caching and asynchronous messaging, and validate robustness with chaos engineering. Next, you’ll discover how to automate deployments and operations through GitOps pipelines, Infrastructure as Code, and self-healing strategies that reduce manual toil. Finally, you’ll learn how to right-size infrastructure using autoscaling, capacity planning, and disaster recovery techniques. When you’re finished with this course, you’ll have the skills and knowledge of production-grade SRE practices needed to build scalable, reliable systems that operate smoothly around the clock.
Table of contents
About the author
Elton is an independent consultant specializing in systems integration with the Microsoft stack. He is a Microsoft MVP, blogger, and practicing Technical Architect.
More Courses by Elton