Simple play icon Course
Skills

Incorporating Site Reliability Engineering (SRE) in Your System Design

by Elton Stoneman

SRE is the hot way to manage apps in production - but are you and your systems ready for it? This course teaches you how to design systems for maximum reliability, find the gaps in your current system design and adopt SRE smoothly and effectively.

What you'll learn

Before you adopt SRE you need to be sure that your systems are designed to work well with SRE practices. In this course, Incorporating Site Reliability Engineering (SRE) in Your System Design, you’ll learn how to design systems with SRE in mind and assess what's missing in your existing systems. First, you’ll discover how to architect apps for reliability, so temporary problems are automatically managed and bigger issues are quickly alerted. Next, you’ll explore how observability design supports SRE and helps you get your apps back online. Finally, you’ll delve into how to effectively measure and report on service levels. When you’re finished with this course, you’ll have the skills and knowledge of system design needed to bring your own apps into SRE.

About the author

Elton is a 10-time Microsoft MVP, author, trainer and speaker. He spent most of his career as a consultant working in Microsoft technologies, architecting and delivering complex solutions for industry leaders. He has delivered APIs on Azure serving millions of clients daily, Big Data solutions processing billions of events weekly, and cutting-edge solutions powered by containers. Elton's experience with .NET goes from .NET 1.0 running on Windows Server, right up to .NET Core running on Linux. Wh... more

Ready to upskill? Get started