Deploying Hadoop with Cloudera CDH to AWS

Learn how to deploy, size, and scale Hadoop in the cloud (namely AWS). You'll understand key concepts to deploy a CDH cluster, perform a manual installation, and finally learn how to automate deployments for multiple clusters with Cloudera Director.
Course info
Level
Intermediate
Updated
Oct 16, 2017
Duration
3h 30m
Table of contents
Course Overview
Why Hadoop in the Cloud? The Case for CDH on AWS
Understanding the Cloud: An AWS Mini Crash Course
More "AWS Mini Crash Course"
Planning Your Hadoop Cluster on AWS
Deploying, Sizing, and Scaling Your CDH Cluster on AWS
Automating Deployments & Managing Clusters with Cloudera Director
Cloudera Altus and Final Takeaway
Description
Course info
Level
Intermediate
Updated
Oct 16, 2017
Duration
3h 30m
Description

Many years ago, hardware cost was pretty steep. It was not unexpected that a project with large amounts of data required 7 figures worth of hardware just to get started. But times have changed, and with cloud services it is possible now to store data cheaply and spin up as many servers with your desired specs to process this data with all kinds of available machines and get the answers that you need. In this course, Deploying Hadoop with Cloudera CDH to AWS, you will learn how to deploy Hadoop in the cloud. First you'll learn about some key topics. Then, you'll learn how to perform deployment manually. Finally, you'll learn about a specialized tool called Cloudera Director that helps automate deployments either for transient or for long running clusters. You will also learn about some differences between AWS and Azure/GCE. These differences can be important if you are working on a different platform, but by no means are they blockers for someone already familiar with their current platform. By the end of this course, you will be able to better manage your cloud needs.

About the author
About the author

Xavier is very passionate about teaching, helping others understand search and Big Data. He is also an entrepreneur, project manager, technical author, trainer, and holds a few certifications with Cloudera, Microsoft, and the Scrum Alliance, along with being a Microsoft MVP.

More from the author
Julia: Getting Started
Beginner
2h 4m
Oct 21, 2020
Getting Started with JSON in C# Using Json.NET
Intermediate
3h 47m
Aug 5, 2020
More courses by Xavier Morera