Preparing Data for Feature Engineering and Machine Learning
By Janani Ravi
Course info



Course info



Description
However well designed and well implemented a machine learning model is, if the data fed in is poorly engineered, the model’s predictions will be disappointing.
In this course, Preparing Data for Feature Engineering and Machine Learning, you will gain the ability to appropriately pre-process your data -- in effect engineer it -- so that you can get the best out of your ML models.
First, you will learn how feature selection techniques can be used to find predictors that contain the most information. Feature selection can be broadly grouped into three categories known as filter, wrapper, and embedded techniques and we will understand and implement all of these.
Next, you will discover how feature extraction differs from feature selection, in that data is substantially re-expressed, sometimes in forms that are hard to interpret. You will then understand techniques for feature extraction from image and text data.
Finally, you will round out your knowledge by understanding how to leverage powerful Python libraries for working with images, text, dates, and geo-spatial data.
When you’re finished with this course, you will have the skills and knowledge to identify the correct feature engineering techniques, and the appropriate solutions for your use-case.
Section Introduction Transcripts
Course Overview
Hi, my name is Janani Ravi, and welcome to this course on Preparing Data for Feature Engineering and Machine Learning. A little about myself, I have a master's degree in electrical engineering from Stanford and have worked at companies such as Microsoft, Google, and Flipkart. At Google, I was one of the first engineers working on real time collaborative editing in Google Docs and I hold four patents for its underlying technologies. I currently work on my own startup, Loonycorn, a studio for high quality video content. However well designed and well implemented a machine learning model is, if the data fed in is poorly engineered, the model's predictions will be disappointing. In this course, you will gain the ability to appropriately pre-process your data, in effect engineer it so that you can get the best out of your ML models. First, you will learn how feature selection techniques can be used to find predictors that contain the most information. Feature selection can be broadly grouped into three categories known as filter, wrapper, and embedded techniques, and we will understand and implement all of these. Next, you will discover how feature extraction defers from feature selection in that data is substantially re-expressed, sometimes in forms that are hard to interpret. You will then understand techniques for feature extraction from image and text data. Finally, you will round out your knowledge by understanding how to leverage powerful Python libraries for working with images, text, dates, and geospatial data. When you're finished with this course, you will have the skills and knowledge to identify the correct feature engineering techniques and the appropriate solutions for your use case.