- Lab
-
Libraries: If you want this lab, consider one of these libraries.
- Cloud

Cleanse Outlying Data Using the pandas Python Package
In this lab, we will load a CSV file into a pandas DataFrame. Once loaded, we will remove rows with an `age` more than 3 standard deviations from the mean and rows with `hours-per-week` below the 10% and above the 90% quantiles. We will then write the cleansed data to a file. Basic Python programming skills will be required for this lab. If you need a refresher, check out the following course: - [Certified Associate in Python Programming Certification](https://acloud.guru/overview/8169e8e7-91a7-4d92-b278-4dd08c787dc6)

Lab Info
Table of Contents
-
Challenge
Load the Data File
Load the
data.csv
file into a pandas DataFrame. -
Challenge
Resolve Outlying age Values
Remove rows with an
age
more than 3 standard deviations from the mean. -
Challenge
Resolve Outlying hours-per-week Values
Remove rows with
hours-per-week
below the 10% and above the 90% quantiles. -
Challenge
Write the Data to a New File
Write the data to a new file named
cleaned_data.csv
.
About the author
Real skill practice before real-world application
Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.
Learn by doing
Engage hands-on with the tools and technologies you’re learning. You pick the skill, we provide the credentials and environment.
Follow your guide
All labs have detailed instructions and objectives, guiding you through the learning process and ensuring you understand every step.
Turn time into mastery
On average, you retain 75% more of your learning if you take time to practice. Hands-on labs set you up for success to make those skills stick.