Featured resource
2025 Tech Upskilling Playbook
Tech Upskilling Playbook

Build future-ready tech teams and hit key business milestones with seven proven plays from industry leaders.

Check it out
  • Course

Coping with Missing, Invalid, and Duplicate Data in R

Learn about the most essential steps of data preparation: Missing value imputation, outlier detection, and duplicate removal.

Intermediate
2h 1m
(10)

Created by Martin Burger

Last Updated Mar 31, 2025

Course Thumbnail
  • Course

Coping with Missing, Invalid, and Duplicate Data in R

Learn about the most essential steps of data preparation: Missing value imputation, outlier detection, and duplicate removal.

Intermediate
2h 1m
(10)

Created by Martin Burger

Last Updated Mar 31, 2025

Get started today

Access this course and other top-rated tech content with one of our business plans.

Try this course for free

Access this course and other top-rated tech content with one of our individual plans.

This course is included in the libraries shown below:

  • Data
What you'll learn

Data preparation is part of nearly any data analytics project, therefore the skills are highly valuable. In this course, Coping with Missing, Invalid, and Duplicate Data in R, you will learn the main steps of data preparation. First, you will learn how to handle duplicate data. Next, you will discover that missing values prevent a lot of R functions from working properly, therefore you are limited in your R toolset as long as you do not take care of all these NA's. Finally, you will explore outlier and invalid data detection and how they can introduce bias into your analysis. When you’re finished with this course, you will understand why missing values, outliers, and duplicates are problematic, how to detect them, and how to remove them from the dataset.

Coping with Missing, Invalid, and Duplicate Data in R
Intermediate
2h 1m
(10)
Table of contents

About the author
Martin Burger - Pluralsight course - Coping with Missing, Invalid, and Duplicate Data in R
Martin Burger
18 courses 4.6 author rating 181 ratings

Martin is a trained biostatistician, programmer, consultant and data science enthusiast. His main objective: Explaining data science in a straightforward way. You can find his latest work over at: r-tutorials.com

2025 Forrester Wave™ names Pluralsight as a Leader among tech skills dev platforms

See how our offering and strategy stack up.

forrester wave report