- Course
SRE: Monitoring and Observability
Learn how to design effective monitoring and alerting systems, implement SLIs, and explore AIOps tools to enhance system reliability through automation and observability practices.
- Course
SRE: Monitoring and Observability
Learn how to design effective monitoring and alerting systems, implement SLIs, and explore AIOps tools to enhance system reliability through automation and observability practices.
Get started today
Access this course and other top-rated tech content with one of our business plans.
Try this course for free
Access this course and other top-rated tech content with one of our individual plans.
This course is included in the libraries shown below:
- Core Tech
What you'll learn
Monitoring and observability are the key tools for SRE teams to manage systems and keep them running smoothly. Collecting and analyzing data about application behavior enables SREs to find and fix issues quickly. In this course, SRE: Monitoring and Observability, you’ll learn about the components of the observability stack and the processes it enables by following an SRE team as they prepare to onboard a new system. First, you’ll learn what data apps need to expose to feed into the monitoring systems. Next, you’ll explore service level indicators to see how to measure performance. Then, you’ll walk through alerting and automated responses to issues. Finally. you’ll look at new tools in the AIOps space. When you’re finished with the course, you'll be able to define and implement an observability stack and use it to drive system reliability.