Y2K and Other Disappointing Disasters: Risk Reduction and Harm Mitigation

By Gremlin
This talk is focused on understanding where we can prevent problems and where we can just make them less bad, and what kinds of tools we can use to make every disaster a disappointing fizzle.
Course info
Level
Intermediate
Updated
Nov 6, 2020
Duration
29m
Table of contents
Y2K and Other Disappointing Disasters: Risk Reduction and Harm Mitigation
Description
Course info
Level
Intermediate
Updated
Nov 6, 2020
Duration
29m
Description

Every disaster is a concatenation of smaller failures. How can we design software and processes to accept that we live in an imperfect world? Explore the concepts of resiliency, harm reduction, over-engineering, and planning for failure with real examples. Risk Reduction is trying to make sure bad things happen as rarely as possible. It's anti-lock brakes and vaccinations and irons that turn off by themselves and all sorts of things that we think of as safety modifications in our life. We are trying to build lives where bad things happen less often. Harm Mitigation is what we do so that when bad things do happen, they are less catastrophic. Building fire sprinklers and seatbelts and needle exchanges are all about making the consequences of something bad less terrible. This talk is focused on understanding where we can prevent problems and where we can just make them less bad, and what kinds of tools we can use to make every disaster a disappointing fizzle. Audiences will leave with a clearer understanding of risk and harm, and a set of tools than can be used to minimize future problems. I'm going to talk about why we need to understand both avoiding problems and making them less catastrophic, and what kinds of tools are appropriate to each. I think that developers need to be thinking about failure states more than we currently do. We talk about avoiding them, or testing them away, but we don't talk about how to make even failure a better experience.

About the author
About the author

Gremlin's enterprise Chaos Engineering platform makes it easy to build more reliable applications in order to prevent outages, innovate faster, and earn customer trust.

More from the author
Failing over without Falling over
Intermediate
21m
Nov 30, 2020
Automating Chaos Attacks at Expedia
Intermediate
24m
Nov 30, 2020
More courses by Gremlin