Course

Skills

Preparing Data for Feature Engineering and Machine Learning

by Janani Ravi

This course covers categories of feature engineering techniques used to get the best results from a machine learning model, including feature selection, and several feature extraction techniques to re-express features in the most appropriate form.

Preview this course

Try for free

Get this course plus top-rated picks in tech skills and other popular topics.

$29.00

per month after 10 day trial

Your 10 day Standard free trial includes

Expert-led courses

Keep up with the pace of change with thousands of expert-led, in-depth courses.

For teams

Give up to 50 users access to our full library including this course free for 30 days

Course info

Rating

(38)

Level

Beginner

Updated

Feb 5, 2020

Duration

3h 17m

What you'll learn

However well designed and well implemented a machine learning model is, if the data fed in is poorly engineered, the model’s predictions will be disappointing.

In this course, Preparing Data for Feature Engineering and Machine Learning, you will gain the ability to appropriately pre-process your data -- in effect engineer it -- so that you can get the best out of your ML models.

First, you will learn how feature selection techniques can be used to find predictors that contain the most information. Feature selection can be broadly grouped into three categories known as filter, wrapper, and embedded techniques and we will understand and implement all of these.

Next, you will discover how feature extraction differs from feature selection, in that data is substantially re-expressed, sometimes in forms that are hard to interpret. You will then understand techniques for feature extraction from image and text data.

Finally, you will round out your knowledge by understanding how to leverage powerful Python libraries for working with images, text, dates, and geo-spatial data.

When you’re finished with this course, you will have the skills and knowledge to identify the correct feature engineering techniques, and the appropriate solutions for your use-case.

Course Overview

1min

Course Overview 2m

Understanding the Role of Features in Machine Learning

37mins

Preparing Data for Machine Learning

43mins

Module Overview 2m
Problems with Data 4m
Dealing with Missing Values 5m
Dealing with Outliers 6m
Applying Different Techniques to Handle Missing Values 7m
Detecting and Handling Outliers 7m
Reading and Exploring the Dataset 8m
Perform Simple and Multiple Linear Regression 4m
Module Summary 1m

Understanding and Implementing Feature Selection

49mins

Module Overview 2m
Types of Data 5m
Measuring Correlations 5m
Understanding Feature Selection Using Filter, Embedded, and Wrapper Methods 6m
Feature Selection Using Missing Value Ratio 5m
Calculating and Visualizing Correlations Using Pandas 6m
Calculating and Visualizing Correlations Using Yellowbrick 3m
Feature Selection Using Filter Methods 6m
Feature Selection Using Wrapper Methods 6m
Feature Selection Using Embedded Methods 5m
Module Summary 2m

Exploring Feature Extraction Techniques

18mins

Module Overview 1m
Representing Images as Matrices and Image Preprocessing Techniques 5m
Feature Detection and Extraction from Images 5m
Feature Extraction from Text 6m
Module Summary 1m

Implementing Feature Extraction

47mins

Module Overview 1m
Tokenization and Visualizing Frequency Distributions 4m
Performing Normalization Using Different Techniques 5m
Creating Feature Vectors from Text Data 6m
Loading and Transforming Images 5m
Extracting Features from Images 3m
Detecting Keypoints and Descriptors to Perform Image Matching 5m
Extracting Text from Images Using OCR 4m
Extracting Features from Dates 5m
Working with Geospatial Features 7m
Summary and Further Study 2m

About the author

Janani Ravi

Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework. After spending years working in tech in the Bay Area, New York, and Singapore at companies such as Microsoft, Google, and Flipkart, Janani finally decided to combine her love for technology with her passion for teaching. She is now the co-founder of Loonycorn, a content studio focused on providing ... more

See more courses by Janani Ravi

Try for free

Get this course plus top-rated picks in tech skills and other popular topics.

$29.00

per month after 10 day trial

Your 10 day Standard free trial includes

Expert-led courses

Keep up with the pace of change with thousands of expert-led, in-depth courses.

For teams

Give up to 50 users access to our full library including this course free for 30 days

Course info

Rating

(38)

Level

Beginner

Updated

Feb 5, 2020

Duration

3h 17m

Ready to upskill? Get started

Contact Sales

Preparing Data for Feature Engineering and Machine Learning

What you'll learn

Table of contents

About the author

Ready to skill up
your entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Ready to skill up
your entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Support

Community

Company

Industries

Newsletter

Contact Sales

Preparing Data for Feature Engineering and Machine Learning

What you'll learn

Table of contents

About the author

Get access now

Ready to skill upyour entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Ready to skill upyour entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Support

Community

Company

Industries

Newsletter

Ready to skill up
your entire team?

Ready to skill up
your entire team?