Microsoft Ignite 2019: Improving Reliability through Modern Operations Practices

Paths

Microsoft Ignite 2019: Improving Reliability through Modern Operations Practices

Author: Microsoft Ignite 2019

We return to check in on Tailwind Traders a year later after their successful migration to the cloud. A new challenge arises as they are not experiencing the reliability they... Read more

What You Will Learn

  • Azure DevOps
  • Incident Handling and Response
  • Deployment Practices

Pre-requisites

None.

Improving Reliability through Modern Operations Practices

We return to check in on Tailwind Traders a year later after their successful migration to the cloud. A new challenge arises as they are not experiencing the reliability they need. Frequent outages, piles of pager alerts, and constant manual effort is needed to keep things running. This is taking a toll on the people involved and having a pronounced negative effect on the business. Tailwind Traders realizes that their previous approaches are not working so they decide they must adopt more modern operations practices. And this is where we join them in their journey. Together, we'll learn the modern operations principles and practices (and the Azure features that support them) to help us achieve the level of reliability we need. This event covers advanced (level 300) content and includes the following technologies: Azure Monitor, Azure Log Analytics, Azure Data Explorer, Microsoft Teams, Logic Apps, Azure Service Health, Azure Workbooks, Azure Service Alerts, Github Actions, Azure Pipelines, ARM Templates, Azure Auto-Scaling, Azure Cost Analysis Tools, and Power BI.

These courses should be watched sequentially.

Building the Foundation for Modern Ops: Monitoring

by Microsoft Ignite 2019

Feb 12, 2020 / 47m

47m

Start Course
Description

You‘re concerned about the reliability of your systems, services, and products. Where should you start? In this session, get an introduction to modern operations disciplines and a framework for reliability work. We jump into monitoring: the foundational practice you must tackle before you can make any headway with reliability. Using Tailwind Traders as an example, we demonstrate how to monitor your environment, including the right (and wrong) things to monitor – and why. You’ll leave with the crucial tools and knowledge you need to discuss and improve reliability using objective data.

Table of contents
  1. Building the Foundation for Modern Ops: Monitoring

Responding to Incidents

by Microsoft Ignite 2019

Feb 12, 2020 / 49m

49m

Start Course
Description

Your systems are down. Customers are calling. Every moment counts. What do you do? Handling incidents well is core to meeting your reliability goals. In this session, explore incident management best practices - through the lens of Tailwind Traders - that will help you triage, remediate, and communicate as effectively as possible. We also walk through some of the tools Azure provides to get you back into a working state when time is of the essence.

Table of contents
  1. Responding to Incidents

Learning from Failure

by Microsoft Ignite 2019

Feb 12, 2020 / 50m

50m

Start Course
Description

Incidents will happen—there’s no doubt about that. The key question is whether you treat them as a learning opportunity to make your operations practice better or just as a loss of time, money, and reputation. In this session, dive into one of the most important topics for improving reliability: how to learn from failure. We listen into one of Tailwind Traders post-incident reviews, often called a postmortem, and use that to learn how to shape and run this process to turn a failure into something actionable. After this session, you’ll be able to build a key feedback loop in your organization that turns unplanned outages into opportunities.

Table of contents
  1. Learning from Failure

Deployment Practices for Greater Reliability

by Microsoft Ignite 2019

Feb 12, 2020 / 45m

45m

Start Course
Description

How software gets delivered to production and how infrastructure gets provisioned have a direct, material impact on your environment’s reliability. In this session, explore delivery and provisioning best practices that Tailwind Traders uses to prevent incidents before they happen. From the discussion and demos, you’ll take away blueprints for these practices, an understanding of how they can be implemented using Azure, and ideas to apply them to your own apps and organization.

Table of contents
  1. Deployment Practices for Greater Reliability