Big Data is a natural evolution of data analysis, scaling beyond the limits of conventional databases. However, they're still an important part of a Hadoop cluster. Learn how to setup databases for Cloudera CDH and install a production grade cluster.
Big Data is a natural evolution of data analysis, scaling beyond the limits of conventional databases. However, this does not mean that databases are dead. On the contrary, they are still an important part of a Hadoop cluster and used to store all kinds of information by multiple services. In this course, Preparing a Production Hadoop Cluster with Cloudera: Databases, you'll learn how to setup databases for Cloudera CDH and install a production grade cluster using Cloudera's Installation Path B. First, you'll discover how to select, initialize, and install a supported database. Next, you'll explore how to configure a database with Cloudera's recommended settings, and how to create databases with CDH services. Finally, you'll learn how to complete a CDH deployment. By the end of this course, you'll be able to deploy a production grade cluster.
Xavier is very passionate about teaching, helping others understand search and Big Data. He is also an entrepreneur, project manager, technical author, trainer, and holds a few certifications with Cloudera, Microsoft, and the Scrum Alliance, along with being a Microsoft MVP.
Hi everyone, my name is Xavier Morera and welcome to my course, Preparing Databases for Your Production Hadoop Cloudera Cluster. I have been working with search and big data for many years[m2-07@00:48], and now I have the pleasure of being part of your journey into the Hadoop ecosystem with cloudera.
Did you know that Many years ago we used databases to analyze our data, finding trends and discovering insights. But as the size of data grew, at some point databases could not keep up, and so we entered a big data world, primarily with Hadoop, scaling where no database could scale before .
In this course, we are going to learn how to set up the required databases for deploying a Hadoop cluster using clouderas distribution, cdh.
Some of the major topics that we will cover include:
How to select, install and initialize a supported database
Configure a database with Clouderas recommended settings
Create databases for CDH services
And finally, complete a CDH deployment following Clouderas Path B installation method
By the end this course, youll know the steps required to deploy a production grade cluster, using an external database.
Before beginning the course, it is recommended that you are familiar with the basics of Hadoop and Cloudera, but dont worry if you are not, you can still follow along.
I hope youll join me on this journey to learn about databases in Hadoop with the Preparing Databases for Your Production Hadoop Cloudera Cluster course, at Pluralsight.