Course

Skills Expanded

Understanding the MapReduce Programming Model

by Janani Ravi

The MapReduce programming model is the de facto standard for parallel processing of Big Data. This course introduces MapReduce, explains how data flows through a MapReduce program, and guides you through writing your first MapReduce program in Java.

Preview this course

Try for free

Get this course plus top-rated picks in tech skills and other popular topics.

$45.00

per month after 10 day trial

Your 10 day Premium free trial includes

Expanded library

This course and over 7,000+ additional courses from our full course library.

Hands-on library

Practice and apply knowledge faster in real-world scenarios with projects and interactive courses.

*Available on Premium only

For teams

Give up to 50 users access to our full library including this course free for 30 days

Course info

Rating

(182)

Level

Beginner

Updated

Jun 20, 2024

Duration

1h 49m

What you'll learn

Processing millions of records requires that you first understand the art of breaking down your tasks into parallel processes. The MapReduce programming model, part of the Hadoop eco-system, gives you a framework to define your solution in terms of parallel tasks, which are then combined to give you the final desired result. In this course, Understanding the MapReduce Programming Model, you'll get an introduction to the MapReduce paradigm. First, you'll learn how it helps you visualize how data flows through the map, partition, shuffle, and sort phases before it gets to the reduce phase and gives you the final result. Next, it will guide you through your very first MapReduce program in Java. Finally, you'll learn to extend the framework Mapper and Reducer classes to plug in your own logic and then run this code on your local machine without using a Hadoop cluster. By the end of this course, you will be able to break big data problems into parallel tasks to help tackle large-scale data munging operations.

Course Overview

1min

Course Overview 1m

Introducing MapReduce

29mins

A "Hello World" MapReduce Job

39mins

Download Hadoop Jars and Set up an Intellij Project 6m
The Map Class Hierarchy 4m
The Reduce Class Hierarchy 3m
The Driver Program 2m
Setting up the Input Data Files 2m
The Map Class Code 8m
The Reduce Class Code 6m
The Main Class and the MapReduce Job 6m
Running Our First MapReduce Job 3m

Controlling Parallelism in Map and Reduce Phases

38mins

Behind the Scenes of a MapReduce Task 6m
Using a Single Reducer 4m
Using Multiple Reducers 5m
Partition, Shuffle, and Sort 5m
Tweaking the Number of Reduce Tasks 2m
Optimize the Map Phase Using a Combiner 6m
Setting a Combiner Class On Your MapReduce 1m
Reducers as Combiners 5m
Constraints in Using Reducers as Combiners 5m

About the author

Janani Ravi

Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework. After spending years working in tech in the Bay Area, New York, and Singapore at companies such as Microsoft, Google, and Flipkart, Janani finally decided to combine her love for technology with her passion for teaching. She is now the co-founder of Loonycorn, a content studio focused on providing ... more

See more courses by Janani Ravi

Try for free

Get this course plus top-rated picks in tech skills and other popular topics.

$45.00

per month after 10 day trial

Your 10 day Premium free trial includes

Expanded library

This course and over 7,000+ additional courses from our full course library.

Hands-on library

Practice and apply knowledge faster in real-world scenarios with projects and interactive courses.

*Available on Premium only

For teams

Give up to 50 users access to our full library including this course free for 30 days

Course info

Rating

(182)

Level

Beginner

Updated

Jun 20, 2024

Duration

1h 49m

Ready to upskill? Get started

Contact Sales

Understanding the MapReduce Programming Model

What you'll learn

Table of contents

About the author

Ready to skill up
your entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Ready to skill up
your entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Support

Community

Company

Industries

Newsletter

Contact Sales

Understanding the MapReduce Programming Model

What you'll learn

Table of contents

About the author

Get access now

Ready to skill upyour entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Ready to skill upyour entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Support

Community

Company

Industries

Newsletter

Ready to skill up
your entire team?

Ready to skill up
your entire team?