Data Wrangling with R

Paths

Data Wrangling with R

Authors: Dan Tofan, Mariah Weatherford

Data wrangling is the process of transforming and mapping data from one form into another, with the intent of making it more available for data analytics. This skill teaches... Read more

What You Will Learn

  • Data Wrangling with R conveys commonly performed data wrangling and transformation patterns using R.

Pre-requisites

  • Data Literacy
  • Mathematics
  • bR for Data Analysts
  • Visualizing Data with R
  • Importing Data with R

Beginner

You’ll learn different ways to combine, group and merge your data.

Merging Data Sources with R

by Dan Tofan

Jul 31, 2019 / 1h 28m

1h 28m

Start Course
Description

In your R data science projects, you need very often to work with data which is spread out across multiple data sources. For example, given two separate data sets on products and their sales, how can you merge them into a new data set? In this course, Merging Data Sources with R, you will gain the ability to merge data from different sources in a controlled way that enables you to keep only the data you need. First, you will learn to merge vectors, which includes using the paste() and append() methods. Next, you will discover how to join data sets with the merge() function, which includes left, inner, right and full outer joins, on data frames that can have one-to-one, one-to-many, or many-to-many relationships. Finally, you will explore how to join data sets with the dplyr package, which covers the previous joins plus anti and semi joins. When you are finished with this course, you will have the skills and knowledge of merging data from different sources, needed to do data wrangling with R.

Table of contents
  1. Course Overview
  2. Managing Vectors
  3. Joining Data Sets with the Merge() Function
  4. Joining Data Sets with dplyr

Splitting and Combining Data with R

by Mariah Weatherford

Oct 25, 2019 / 1h 57m

1h 57m

Start Course
Description

Summarizing statistics across groups is invaluable for comparing categories of observations. In this course, Splitting and Combining Data with R, you'll explore splitting data into groups based on some criteria, applying functions or calculations to each group independently, and combining the results into a data structure. To begin, you’ll learn how to create custom categorical variables for grouping, and custom numeric variables to which you can apply functions. Next, with the criteria for grouping created, you will split the data, apply functions, and combine the data into a data structure. Finally, with the raw data transformed, you’ll discover how a grouped dataframe can then be ungrouped with summary statistics maintained, or keep the grouped dataframe intact with plotting functions for visualizing variation between groups. By the end of this course, you’ll have a better understanding of how to use R to build data pipelines with dplyr, manipulate strings and dates for feature engineering, and create customized ggplot charts. .

Table of contents
  1. Course Overview
  2. Creating Variables to Combine/Summarize Data
  3. Grouping and Summarizing Data
  4. Plotting Combined Data
Offer Code *
Email * First name * Last name *
Company
Title
Phone
Country *

* Required field

Opt in for the latest promotions and events. You may unsubscribe at any time. Privacy Policy

By providing my phone number to Pluralsight and toggling this feature on, I agree and acknowledge that Pluralsight may use that number to contact me for marketing purposes, including using autodialed or pre-recorded calls and text messages. I understand that consent is not required as a condition of purchase from Pluralsight.

By activating this benefit, you agree to abide by Pluralsight's terms of use and privacy policy.

I agree, activate benefit