Featured resource
2026 Tech Forecast
2026 Tech Forecast

Stay ahead of what’s next in tech with predictions from 1,500+ business leaders, insiders, and Pluralsight Authors.

Get these insights
  • Course

Data Preparation and Exploration in Databricks

Are you planning for large-scale data preparation and analysis? This course will teach you Databricks - how to use it to explore, analyze, clean and transform data; store data in Delta format; and visualize it using Databricks charts and dashboards.

Beginner
1h 32m
(8)

Created by Mohit Batra

Last Updated Mar 19, 2025

Course Thumbnail
  • Course

Data Preparation and Exploration in Databricks

Are you planning for large-scale data preparation and analysis? This course will teach you Databricks - how to use it to explore, analyze, clean and transform data; store data in Delta format; and visualize it using Databricks charts and dashboards.

Beginner
1h 32m
(8)

Created by Mohit Batra

Last Updated Mar 19, 2025

Get started today

Access this course and other top-rated tech content with one of our business plans.

Try this course for free

Access this course and other top-rated tech content with one of our individual plans.

This course is included in the libraries shown below:

  • Data
What you'll learn

Databricks is a unified analytics platform that can handle huge volumes of data, process it faster, and help explore and analyze the data in-depth.

In this course, Data Preparation and Exploration in Databricks, you’ll gain the ability to explore, analyze, clean, and transform data using the Databricks platform; store the processed data in Delta Lake format; and visualize the data using Databricks charts and dashboards. First, you’ll learn how to set up the Databricks environment. Then, you’ll discover how to extract data from multiple sources in Databricks - like Azure Data Lake Store and Databricks File System (DBFS) - and create Spark DataFrames. Next, you’ll go over how to explore and analyze data using Spark and Databricks features, clean and transform the data using Spark, and visualize the data using Databricks charts and dashboards.

Following this, you’ll see how to store the processed data in Data Lake or as Databricks Tables in Delta Lake format. Finally, you’ll learn how to do performance optimization in Databricks. When you’re finished with this course, you’ll have the skills and knowledge of Databricks needed to prepare and explore the data.

Data Preparation and Exploration in Databricks
Beginner
1h 32m
(8)
Table of contents

About the author
Mohit Batra - Pluralsight course - Data Preparation and Exploration in Databricks
Mohit Batra
14 courses 4.7 author rating 871 ratings

Mohit is a Data Engineer, a Microsoft Certified Trainer (MCT) and a consultant. Mohit has 15+ years of extensive experience in architecting large scale Business Intelligence, Data Warehousing and Big Data solutions with companies like Microsoft and some leading investment banks.

Get started with Pluralsight