Expanded

Observability and the Paradox of Alerts

SLOs and observability are the key to a safe, sane on call rotation that is not severely life-impacting. This session will share how to get there.
Course info
Level
Intermediate
Updated
Oct 6, 2020
Duration
1h 25m
Table of contents
Observability and the Paradox of Alerts
Description
Course info
Level
Intermediate
Updated
Oct 6, 2020
Duration
1h 25m
Description

How many different paging alerts wake up your team from how many different systems? Many teams have too many paging alerts for them to usefully manage today, and most teams are hurtling toward an unsustainable future. It is a paradox of scale: The bigger and more complicated your systems get, the fewer paging alerts you should have. The good news is that most teams who move from a monitoring model to observability are able to delete about 90% of paging alerts while increasing reliability from the customer’s perspective. SLOs and observability are the key to a safe, sane on call rotation that is not severely life-impacting — a rotation your senior engineers will be proud to join, not one staffed by everyone who isn't yet influential enough to get out of it. In this session, Charity Majors will tell you how to get there.

About the author
About the author

DevSecCon is the global community dedicated to DevSecOps to help implement security in the overall development process.

More from the author
gRPC on Kubernetes
Intermediate
24m
Oct 7, 2020
When Music and Software Come Together
Intermediate
25m
Oct 7, 2020
More courses by DevSecCon