Preparing a Production Hadoop Cluster with Cloudera: Databases

Big Data is a natural evolution of data analysis, scaling beyond the limits of conventional databases. However, they're still an important part of a Hadoop cluster. Learn how to setup databases for Cloudera CDH and install a production grade cluster.
Course info
Rating
(14)
Level
Beginner
Updated
Jun 5, 2017
Duration
1h 31m
Table of contents
Course Overview
Understanding Databases in Big Data and with Cloudera CDH
Setting up a Production Database for Your Hadoop Cluster
Configuring Your MySQL Database for Cloudera Manager (Path B)
Preparing Your Databases and Deploying CDH
Preparing Your Database for High Availability
Final Takeaway
Description
Course info
Rating
(14)
Level
Beginner
Updated
Jun 5, 2017
Duration
1h 31m
Description

Big Data is a natural evolution of data analysis, scaling beyond the limits of conventional databases. However, this does not mean that databases are dead. On the contrary, they are still an important part of a Hadoop cluster and used to store all kinds of information by multiple services. In this course, Preparing a Production Hadoop Cluster with Cloudera: Databases, you'll learn how to setup databases for Cloudera CDH and install a production grade cluster using Cloudera's Installation Path B. First, you'll discover how to select, initialize, and install a supported database. Next, you'll explore how to configure a database with Cloudera's recommended settings, and how to create databases with CDH services. Finally, you'll learn how to complete a CDH deployment. By the end of this course, you'll be able to deploy a production grade cluster.

About the author
About the author

Xavier is very passionate about teaching, helping others understand search and Big Data. He is also an entrepreneur, project manager, technical author, trainer, and holds a few certifications with Cloudera, Microsoft, and the Scrum Alliance, along with being a Microsoft MVP.

More from the author
T-SQL Data Manipulation Playbook
Intermediate
2h 54m
Sep 27, 2019
Programming Python Using an IDE
Intermediate
2h 0m
Jun 26, 2019
More courses by Xavier Morera
Section Introduction Transcripts
Section Introduction Transcripts

Course Overview
Hi everyone. My name is Xavier Morera, and welcome to my course, Preparing Databases for Your Production Hadoop Cloudera Cluster. I have been working with search and big data for many years, and now I have the pleasure of being part of your journey into the Hadoop ecosystem with Cloudera. Did you know that many years ago we used databases to analyze our data, finding trends, and discovering insights? But as the size of the data grew, at some point databases could not keep up, and so we entered a big data world, primarily with Hadoop, scaling where no database could scale before. In this course, we're going to learn how to set up the required databases for deploying a Hadoop cluster using Cloudera's distribution, CDH. Some of the topics that we will cover include how to select, install, and initialize a supported database, configure a database with Cloudera's recommended settings, create databases for CDH services, and finally complete a CDH deployment following Cloudera's Path B installation method. By the end of this course, you will know the steps required to deploy a production grade cluster using an external database. And before beginning this course, it is recommended that you are familiar with the basics of Hadoop and Cloudera. But don't worry if you are not. You can still follow along and learn the process. I hope you'll join me on this journey to learn about databases in Hadoop with the Preparing Databases for Your Production Hadoop Cloudera Cluster course at Pluralsight.