Managing Cloud Resources Using Google Stackdriver

This course covers all important aspects of Stackdriver monitoring, which works across all Google Cloud Platform resources, making it convenient to set up uptime checks, profiling, and integration with other cloud platforms, and monitoring tools.
Course info
Level
Beginner
Updated
Jan 16, 2019
Duration
1h 36m
Table of contents
Description
Course info
Level
Beginner
Updated
Jan 16, 2019
Duration
1h 36m
Description

Stackdriver Monitoring is a powerful and versatile cloud monitoring tool that is tightly integrated with virtually every service on the Google Cloud Platform. You can significantly improve the performance and design of your architecture and simplify troubleshooting if you master the nuances of Stackdriver Monitoring. In this course, Managing Cloud Resources Using Google Stackdriver, you will gain the ability to monitor your cloud resources track both system and user-defined metrics and respond to alerts using Stackdriver Monitoring. First, you will learn Stackdriver concepts such as metrics, monitored resources, workspaces, and alerting policies. In this process, we will learn how to install the Stackdriver monitoring agent, and also when that agent is and is not required. Next, you will discover how to monitor third-party applications and work with custom metrics. We will create resources to monitor as well as metrics associated with those resources, then use the Metrics Explorer to create dashboards to keep track of those metrics. You will also configure uptime checks and alerts to notify you when resource health is not satisfactory. Stackdriver supports uptime checks in HTTP, HTTPS, and TCP. The probes sent by these checks are governed by VPC firewall rules, so those must be set up correctly as well. Finally, you will explore how to create checks for the absence of metrics, set variables in alerts, and explore incidents and events and integrate with third-party tools. Specifically, you will integrate Stackdriver Monitoring with OpsGenie, which is an alerting and incident management platform. You will round out the course by programmatically working with the Stackdriver Monitoring API from within Datalab python notebooks. When you’re finished with this course, you will have the skills and knowledge of Stackdriver Monitoring needed to monitor, troubleshoot, and analyze the usage of your cloud resources.

About the author
About the author

A problem solver at heart, Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework.

More from the author
Building Features from Image Data
Advanced
2h 10m
Aug 13, 2019
Designing a Machine Learning Model
Intermediate
3h 25m
Aug 13, 2019
More courses by Janani Ravi
Section Introduction Transcripts
Section Introduction Transcripts

Course Overview
Hi, my name is Janani Ravi, and welcome to this course on Managing Cloud Resources Using Google Stackdriver. A little about myself, I have a masters in electrical engineering from Stanford and have worked at companies such as Microsoft, Google, and Flipkart. At Google I was one of the first engineers working on real-time collaborative editing in Google Docs, and I hold four patents for its underlying technologies. I currently work on my own startup, Loonycorn, a studio for high-quality video content. In this course, you will gain the ability to monitor your cloud resources, track both system and user-defined metrics, and respond to alerts using Stackdriver Monitoring. First, you will learn Stackdriver concepts such as metrics, monitored resources, workspaces, and alerting policies. In this process, we will learn how to install the Stackdriver Monitoring agent and the metrics you can track with this agent. Next, you'll discover how to monitor third-party applications and work with custom metrics. We will create resources to monitor, as well as metrics associated with these resources, and then use the Metrics Explorer to create dashboards to keep track of these metrics. You will also configure uptime checks and alerts to notify you when resource help is not satisfactory. Finally, you will explore how to create checks for the absence of metrics, how to set variables and alerts, explore incidents and events, and integrate with third-party tools. Specifically, you will integrate Stackdriver Monitoring with Opsgenie, which is an alerting and incident management platform. You'll round out the course by programmatically working with Stackdriver Monitoring APIs from within Datalab Python Notebooks. When you're finished with this course, you'll have the skills and knowledge of Stackdriver Monitoring needed to monitor, troubleshoot, and analyze the usage of your cloud resources.