As the data you store expands in size, traditional relational databases may no longer work. HBase has the ability to deal with billions of rows of data and each record can contains millions of fields. This course will help you get started with HBase.
Billions of records with millions of fields of semi-structured, unformatted data is the reality of the kind of data we are storing today. Traditional databases are bound by strict data layout requirements and constraints that, unfortunately, do not scale to meet big data requirements. HBase reimagines how data can be stored in a distributed system. This course, Getting Started with HBase: The Hadoop Database, teaches you how to use HBase from the start to finish. First, you'll learn how to design and layout data in a columnar format in order to optimize disk seeks and reduce read latency. Next, you'll learn how to manipulate and access this data using the command line HBase shell as well as the HBase Java API. Finally, you'll learn to process this data by performing complex aggregation and grouping operations using the MapReduce programming model with HBase. By the end of this course, you'll be ready to start making your data much more manageable using HBase.
A problem solver at heart, Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework.