Simple play icon Course
Skills

Getting Started with HBase: The Hadoop Database

by Janani Ravi

As the data you store expands in size, traditional relational databases may no longer work. HBase has the ability to deal with billions of rows of data and each record can contains millions of fields. This course will help you get started with HBase.

What you'll learn

Billions of records with millions of fields of semi-structured, unformatted data is the reality of the kind of data we are storing today. Traditional databases are bound by strict data layout requirements and constraints that, unfortunately, do not scale to meet big data requirements. HBase reimagines how data can be stored in a distributed system. This course, Getting Started with HBase: The Hadoop Database, teaches you how to use HBase from the start to finish. First, you'll learn how to design and layout data in a columnar format in order to optimize disk seeks and reduce read latency. Next, you'll learn how to manipulate and access this data using the command line HBase shell as well as the HBase Java API. Finally, you'll learn to process this data by performing complex aggregation and grouping operations using the MapReduce programming model with HBase. By the end of this course, you'll be ready to start making your data much more manageable using HBase.

Table of contents

About the author

Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework. After spending years working in tech in the Bay Area, New York, and Singapore at companies such as Microsoft, Google, and Flipkart, Janani finally decided to combine her love for technology with her passion for teaching. She is now the co-founder of Loonycorn, a content studio focused on providing ... more

Ready to upskill? Get started