Course

Skills

Performing Dimension Analysis with R

by Janani Ravi

This course covers virtually all of the important techniques of dimensionality reduction available in R, allowing model builders to optimize model performance by reducing overfitting and saving on model training time and cost.

Preview this course

Try for free

Get this course plus top-rated picks in tech skills and other popular topics.

$29.00

per month after 10 day trial

Your 10 day Standard free trial includes

Expert-led courses

Keep up with the pace of change with thousands of expert-led, in-depth courses.

For teams

Give up to 50 users access to our full library including this course free for 30 days

Course info

Level

Intermediate

Updated

Mar 20, 2020

Duration

2h 11m

What you'll learn

Dimensionality Reduction is a powerful and versatile unsupervised machine learning technique that can be used to improve the performance of virtually every ML model. Using dimensionality reduction, you can significantly speed up model training and validation, saving both time and money, as well as greatly reducing the risk of overfitting.

In this course, Performing Dimension Analysis with R, you will gain the ability to design and implement an exhaustive array of feature selection and dimensionality reduction techniques in R. First, you will learn the importance of dimensionality reduction and understand the pitfalls of working with data of excessively high-dimensionality, often referred to as the curse of dimensionality. Next, you will discover how to implement simple feature selection techniques to decide which subset of the existing features we might choose to use while losing as little information from the original, full dataset as possible.

You will then learn important techniques for reducing dimensionality in linear data. Such techniques, notably Principal Components Analysis and Linear Discriminant Analysis, seek to re-orient the original data using new, optimized axes. The choice of these axes is driven by numeric procedures such as Eigenvalue and Singular Value Decomposition.

You will then move to dealing with manifold data, which is non-linear and often takes the form of Swiss rolls and S-curves. Such data presents an illusion of complexity but is actually easily simplified by unrolling the manifold.

Finally, you will explore how to implement a wide variety of manifold learning techniques including multi-dimensional scaling (MDS), Isomap, and t-distributed Stochastic Neighbor Embedding (t-SNE). You will round out the course by comparing the results of these manifold unrolling techniques with artificially generated data. When you are finished with this course, you will have the skills and knowledge of Dimensionality Reduction needed to design and implement ways to mitigate the curse of dimensionality in R.

Course Overview

2mins

Course Overview 2m

Understanding the Importance of Reducing Complexity in Data

44mins

Performing Dimensional Analysis for Continuous Data

31mins

Principal Components Analysis 7m
Partial Least Squares Regression 4m
Demo: Performing Principal Component Analysis 6m
Demo: Regression Using Principal Components 5m
Demo: Principal Components Regression 5m
Demo: Partial Least Squares Regression 4m

Performing Dimensional Analysis for Categorical Data

24mins

Linear Discriminant Analysis and Quadratic Discriminant Analysis 7m
Demo: Exploring and Preparing Data for Linear Discriminant Analysis 7m
Demo: Performing Linear Discriminant Analysis on Glass Data 8m
Demo: Performing Quadratic Discriminant Analysis on Glass Data 4m

Performing Dimensional Analysis for Non-linear Data

29mins

Manifold Learning 8m
Demo: Manifold Learning Using Metric and Non-metric MDS 8m
Demo: Manifold Learning Using t-SNE, Isomap, and LLE 6m
Demo: Kernel PCA for Non-linearly Separable Data 5m
Module Summary 2m

About the author

Janani Ravi

Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework. After spending years working in tech in the Bay Area, New York, and Singapore at companies such as Microsoft, Google, and Flipkart, Janani finally decided to combine her love for technology with her passion for teaching. She is now the co-founder of Loonycorn, a content studio focused on providing ... more

See more courses by Janani Ravi

Try for free

Get this course plus top-rated picks in tech skills and other popular topics.

$29.00

per month after 10 day trial

Your 10 day Standard free trial includes

Expert-led courses

Keep up with the pace of change with thousands of expert-led, in-depth courses.

For teams

Give up to 50 users access to our full library including this course free for 30 days

Course info

Level

Intermediate

Updated

Mar 20, 2020

Duration

2h 11m

Ready to upskill? Get started

Contact Sales

Performing Dimension Analysis with R

What you'll learn

Table of contents

About the author

Ready to skill up
your entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Ready to skill up
your entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Support

Community

Company

Industries

Newsletter

Contact Sales

Performing Dimension Analysis with R

What you'll learn

Table of contents

About the author

Get access now

Ready to skill upyour entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Ready to skill upyour entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Support

Community

Company

Industries

Newsletter

Ready to skill up
your entire team?

Ready to skill up
your entire team?