Troubleshooting production in Kubernetes clusters can be difficult. In this course, Maintaining, Monitoring and Troubleshooting Kubernetes, you’ll learn to maintain, monitor, and troubleshoot Kubernetes Clusters. First, you’ll learn core cluster maintenance tasks such as etcd backup and restore operations, upgrading an existing Kubernetes Cluster, and facilitating for Worker Node maintenance and high availability cluster topologies. Next, you’ll discover how to quickly find and analyze performance monitoring and logging data in your cluster. Finally, you’ll explore core tools and techniques for troubleshooting your cluster. When you’re finished with this course, you’ll have the skills and knowledge of maintaining, monitoring, and troubleshooting needed to operate and manage Kubernetes Clusters.
Anthony is the Founder and President of Centino Systems as well as a Pluralsight Author and a Microsoft Data Platform MVP, Linux Expert, and Corporate Problem Solver. Anthony designs solutions, deploys the technology, and provides expertise on business system performance, architecture, and security. Anthony has a Bachelors and Masters in Computer Science with research publications in high performance/low latency data access algorithms and spatial database systems.
Course Overview Hi everyone, my name is Anthony Nocentino, enterprise architect and founder of Centino Systems. Welcome to my course, Maintaining, Monitoring, and Troubleshooting Kubernetes. Are you a systems administrator that needs to maintain Kubernetes clusters? If so, then this is the course for you. In this course, you will learn core Kubernetes cluster maintenance tasks. We'll start off with etcd and learn backup and restore operations. You will then learn how to upgrade a cluster, how to perform maintenance on worker nodes, and high availability cluster topologies. Next, we'll look at how monitoring and logging enable you to understand what's happening inside your cluster. We'll look at the Kubernetes logging architecture, learning where logs are stored for the control plane, nodes, pods, and containers. We'll learn how to access and review all of those logs. You'll also learn how to access key performance metrics for pods and nodes in your cluster. And finally, you will learn the tools and techniques needed to troubleshoot a Kubernetes cluster, focusing on the control plane and worker nodes. I'll walk you through several key failure scenarios for both worker and control plane nodes, and we'll find and fix those issues together. By the end of this course, you'll have what it takes to maintain, monitor, and troubleshoot Kubernetes clusters. Before beginning this course, you should be familiar with the Linux operating system and administering at the command line. You should have a firm understanding of TCP/IP‑based networking and also understand the fundamental concepts of containers. You will need to know the core foundations of Kubernetes, like what a cluster is and how to interact with your cluster at the command line. I hope you'll join me on this journey to learn how to maintain, monitor, and troubleshoot your Kubernetes cluster in Maintaining, Monitoring, and Troubleshooting Kubernetes, here at Pluralsight.