Featured resource
2025 Tech Upskilling Playbook
Tech Upskilling Playbook

Build future-ready tech teams and hit key business milestones with seven proven plays from industry leaders.

Check it out
  • Lab
    • Libraries: If you want this lab, consider one of these libraries.
    • AI
    • Cloud
Google Cloud Platform icon
Labs

Training Reports Utilized in SageMaker Debugger to Improve Your Models

Imagine you are the data engineer at your company, and your company has just selected AWS as the preferred cloud provider. You have been given a dataset to predict if an individual makes more than $50K in salary. As part of the modeling process, you have been asked to generate a summary of the model training evaluation results, insights into the model performance, and interactive graphs. In this lab, you will fetch the census data and use that as the input dataset. Once the data is split, the data is uploaded to the S3 bucket. Then, the Sagemaker estimator is configured with the debugger hook and Sagemaker built-in rules to generate performance metric reports.

Google Cloud Platform icon
Lab platform
Lab Info
Level
Intermediate
Last updated
Sep 24, 2025
Duration
1h 0m

Contact sales

By filling out this form and clicking submit, you acknowledge our privacy policy.
Table of Contents
  1. Challenge

    Launch SageMaker Notebook
    1. Log in to the AWS console and navigate to AWS SageMaker.
    2. Load the Jupyter Notebook that has been provided with this hands-on lab.
  2. Challenge

    Install dependencies and import the libraries
    1. Load required libraries.
    2. Fetch the census dataset from shap library.
  3. Challenge

    Download the data and upload them to S3 bucket
    1. Convert the downloaded data to a CSV format
    2. Use the upload_file function and upload the CSV files to S3 bucket
  4. Challenge

    Set up training and validation data
    1. Create the inputs for the fit() function with the training and validation location.
  5. Challenge

    Configure and run the estimator
    1. Fetch the xgboost image according to the region.
    2. Setup the estimator function with the debugger hook and debugger built-in rules.
    3. Initiate the training process.
  6. Challenge

    View the generated reports
    1. Fetch the training and profile reports from S3 bucket.
    2. Review the graphs generated by the training report.
    3. Understand the resource utilization statistics generated by the profiling report.
About the author

Pluralsight Skills gives leaders confidence they have the skills needed to execute technology strategy. Technology teams can benchmark expertise across roles, speed up release cycles and build reliable, secure products. By leveraging our expert content, skill assessments and one-of-a-kind analytics, keep up with the pace of change, put the right people on the right projects and boost productivity. It's the most effective path to developing tech skills at scale.

Real skill practice before real-world application

Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.

Learn by doing

Engage hands-on with the tools and technologies you’re learning. You pick the skill, we provide the credentials and environment.

Follow your guide

All labs have detailed instructions and objectives, guiding you through the learning process and ensuring you understand every step.

Turn time into mastery

On average, you retain 75% more of your learning if you take time to practice. Hands-on labs set you up for success to make those skills stick.

Get started with Pluralsight