Developing Spark Applications Using Scala & Cloudera

Apache Spark is one of the fastest and most efficient general engines for large-scale data processing. In this course, you'll learn how to develop Spark applications for your Big Data using Scala and a stable Hadoop distribution, Cloudera CDH.
Course info
Rating
(35)
Level
Beginner
Updated
May 3, 2018
Duration
5h 41m
Table of contents
Course Overview
Why Spark with Scala and Cloudera?
Getting an Environment and Data: CDH + StackOverflow
Refreshing Your Knowledge: Scala Fundamentals for This Course
Understanding Spark: An Overview
Getting Technical with Spark
Learning the Core of Spark: RDDs
Going Deeper into Spark Core
Increasing Proficiency with Spark: DataFrames and Spark SQL
Continuing the Journey on DataFrames and Spark SQL
Working with a Typed API: Datasets
Final Takeaway and Continuing the Journey with Spark
Description
Course info
Rating
(35)
Level
Beginner
Updated
May 3, 2018
Duration
5h 41m
Description

At the core of working with large-scale datasets is a thorough knowledge of Big Data platforms like Apache Spark and Hadoop. In this course, Developing Spark Applications Using Scala & Cloudera, you’ll learn how to process data at scales you previously thought were out of your reach. First, you’ll learn all the technical details of how Spark works. Next, you’ll explore the RDD API, the original core abstraction of Spark. Then, you’ll discover how to become more proficient using Spark SQL and DataFrames. Finally, you'll learn to work with Spark's typed API: Datasets. When you’re finished with this course, you’ll have a foundational knowledge of Apache Spark with Scala and Cloudera that will help you as you move forward to develop large-scale data applications that enable you to work with Big Data in an efficient and performant way.

About the author
About the author

Xavier is very passionate about teaching, helping others understand search and Big Data. He is also an entrepreneur, project manager, technical author, trainer, and holds a few certifications with Cloudera, Microsoft, and the Scrum Alliance, along with being a Microsoft MVP.

More from the author
Computer Vision: Executive Briefing
Beginner
29m
Apr 1, 2020
More courses by Xavier Morera