Big Data is a natural evolution of data analysis, scaling beyond the limits of conventional databases. However, they're still an important part of a Hadoop cluster. Learn how to setup databases for Cloudera CDH and install a production grade cluster.
Big Data is a natural evolution of data analysis, scaling beyond the limits of conventional databases. However, this does not mean that databases are dead. On the contrary, they are still an important part of a Hadoop cluster and used to store all kinds of information by multiple services. In this course, Preparing a Production Hadoop Cluster with Cloudera: Databases, you'll learn how to setup databases for Cloudera CDH and install a production grade cluster using Cloudera's Installation Path B. First, you'll discover how to select, initialize, and install a supported database. Next, you'll explore how to configure a database with Cloudera's recommended settings, and how to create databases with CDH services. Finally, you'll learn how to complete a CDH deployment. By the end of this course, you'll be able to deploy a production grade cluster.
Xavier is very passionate about teaching, helping others understand search and Big Data. He is also an entrepreneur, project manager, technical author, trainer, and holds a few certifications with Cloudera, Microsoft, and the Scrum Alliance, along with being a Microsoft MVP.
Course Overview Hi everyone. My name is Xavier Morera, and welcome to my course, Preparing Databases for Your Production Hadoop Cloudera Cluster. I have been working with search and big data for many years, and now I have the pleasure of being part of your journey into the Hadoop ecosystem with Cloudera. Did you know that many years ago we used databases to analyze our data, finding trends, and discovering insights? But as the size of the data grew, at some point databases could not keep up, and so we entered a big data world, primarily with Hadoop, scaling where no database could scale before. In this course, we're going to learn how to set up the required databases for deploying a Hadoop cluster using Cloudera's distribution, CDH. Some of the topics that we will cover include how to select, install, and initialize a supported database, configure a database with Cloudera's recommended settings, create databases for CDH services, and finally complete a CDH deployment following Cloudera's Path B installation method. By the end of this course, you will know the steps required to deploy a production grade cluster using an external database. And before beginning this course, it is recommended that you are familiar with the basics of Hadoop and Cloudera. But don't worry if you are not. You can still follow along and learn the process. I hope you'll join me on this journey to learn about databases in Hadoop with the Preparing Databases for Your Production Hadoop Cloudera Cluster course at Pluralsight.