Featured resource
2025 Tech Upskilling Playbook
Tech Upskilling Playbook

Build future-ready tech teams and hit key business milestones with seven proven plays from industry leaders.

Check it out
  • Lab
    • Libraries: If you want this lab, consider one of these libraries.
    • AI
    • Cloud
Google Cloud Platform icon
Labs

Creating a scikit-learn Random Forest Classifier in Amazon SageMaker

Scikit-learn is a great place to start working with machine learning. In this lab, we will use scikit-learn to create a Random Forest Classifier to determine if you prefer cats or dogs. The data set being used is entirely made up, but could easily be swapped with one of your own!

Google Cloud Platform icon
Lab platform
Lab Info
Level
Advanced
Last updated
Sep 22, 2025
Duration
1h 0m

Contact sales

By filling out this form and clicking submit, you acknowledge our privacy policy.
Table of Contents
  1. Challenge

    Navigate to the Jupyter Notebook

    Log in to the AWS console and navigate to the AWS SageMaker page. From there, load the Jupyter Notebook that has been provided with this hands-on lab.

  2. Challenge

    Load and Prepare the Data
    1. Load the survey data from data.csv, located beside the notebook.
    2. View the data. Look at both the raw data and statistics for the data.
    3. Change the column data types so the model can understand them.
    4. Split the data into training and testing sets. Use 80% of the data for training.
  3. Challenge

    Train the Random Forest Model
    1. Create a Random Forest Classifier model using scikit-learn.
    2. Train the model using the training data.
  4. Challenge

    Evaluate the Model
    1. Generate predictions for the testing data set.
    2. View the confusion matrix for the predictions.
    3. Calculate the sensitivity and specificity.
    4. Plot the ROC curve.
    5. Calculate the area under the curve.
  5. Challenge

    Predict for Yourself
    1. Create a survey response for yourself.
    2. Have the model predict if you prefer cats or dogs.
About the author

Pluralsight Skills gives leaders confidence they have the skills needed to execute technology strategy. Technology teams can benchmark expertise across roles, speed up release cycles and build reliable, secure products. By leveraging our expert content, skill assessments and one-of-a-kind analytics, keep up with the pace of change, put the right people on the right projects and boost productivity. It's the most effective path to developing tech skills at scale.

Real skill practice before real-world application

Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.

Learn by doing

Engage hands-on with the tools and technologies you’re learning. You pick the skill, we provide the credentials and environment.

Follow your guide

All labs have detailed instructions and objectives, guiding you through the learning process and ensuring you understand every step.

Turn time into mastery

On average, you retain 75% more of your learning if you take time to practice. Hands-on labs set you up for success to make those skills stick.

Get started with Pluralsight