Preparing a Production Hadoop Cluster with Cloudera: Databases

Big Data is a natural evolution of data analysis, scaling beyond the limits of conventional databases. However, they're still an important part of a Hadoop cluster. Learn how to setup databases for Cloudera CDH and install a production grade cluster.
Course info
Level
Beginner
Updated
June 5, 2017
Duration
1h 31m
Table of contents
Description
Course info
Level
Beginner
Updated
June 5, 2017
Duration
1h 31m
Description

Big Data is a natural evolution of data analysis, scaling beyond the limits of conventional databases. However, this does not mean that databases are dead. On the contrary, they are still an important part of a Hadoop cluster and used to store all kinds of information by multiple services. In this course, Preparing a Production Hadoop Cluster with Cloudera: Databases, you'll learn how to setup databases for Cloudera CDH and install a production grade cluster using Cloudera's Installation Path B. First, you'll discover how to select, initialize, and install a supported database. Next, you'll explore how to configure a database with Cloudera's recommended settings, and how to create databases with CDH services. Finally, you'll learn how to complete a CDH deployment. By the end of this course, you'll be able to deploy a production grade cluster.

About the author
About the author

Xavier is very passionate about teaching, helping others understand search and Big Data. He is also an entrepreneur, project manager, technical author, trainer, and holds a few certifications with Cloudera, Microsoft, and the Scrum Alliance, along with being a Microsoft MVP.

More from the author
Deploying Hadoop with Cloudera CDH to AWS
Intermediate
3h 30m
16 Oct 2017
More courses by Xavier Morera
Transcript
Transcript

Hi everyone, my name is Xavier Morera and welcome to my course, Preparing Databases for Your Production Hadoop Cloudera Cluster. I have been working with search and big data for many years[m2-07@00:48], and now I have the pleasure of being part of your journey into the Hadoop ecosystem with cloudera.

Did you know that Many years ago we used databases to analyze our data, finding trends and discovering insights. But as the size of data grew, at some point databases could not keep up, and so we entered a big data world, primarily with Hadoop, scaling where no database could scale before….

In this course, we are going to learn how to set up the required databases for deploying a Hadoop cluster using cloudera’s distribution, cdh.

Some of the major topics that we will cover include:

  • How to select, install and initialize a supported database
  • Configure a database with Cloudera’s recommended settings
  • Create databases for CDH services
  • And finally, complete a CDH deployment following Cloudera’s Path B installation method
By the end this course, you’ll know the steps required to deploy a production grade cluster, using an external database.

Before beginning the course, it is recommended that you are familiar with the basics of Hadoop and Cloudera, but don’t worry if you are not, you can still follow along.

I hope you’ll join me on this journey to learn about databases in Hadoop with the Preparing Databases for Your Production Hadoop Cloudera Cluster course, at Pluralsight.