Libraries: If you want this lab, consider one of these libraries.
Data

SQL Fundamentals: Grouping and Summarizing Data

In this lab, you learned how to group and summarize data using the `GROUP BY` and `HAVING` clauses, and how to retrieve distinct values with the `DISTINCT` keyword. These techniques are essential for analyzing and understanding large datasets, enabling you to extract meaningful insights and make data-driven decisions. By mastering these skills, you are well-prepared to tackle more complex SQL queries and contribute to effective data analysis in any organization.

Get started Contact sales

Lab Info

Level

Intermediate

Last updated

Jan 06, 2026

Duration

30m

Challenge

Grouping Data with GROUP BY

SQL Guide

For each task, replace or update the existing SQL code in the top pane (main.sql) with your answer to the task and run it to see the results in the bottom pane (SQL Viewer).

Step 1: Grouping Data with GROUP BY

To review the concepts covered in this step, please refer to the course Summarizing Data with GROUP BY module of the Introduction to SQL Course.

Grouping data is a powerful feature in SQL that allows you to perform calculations and summarize data within groups of rows. The GROUP BY clause is essential when you want to aggregate data across multiple records, such as counting the number of products in each category or calculating the average salary in each department. In this step, you will learn how to group data effectively and apply aggregate functions to generate meaningful summaries.

🫙Database Structure

Tables You Will Be Working With:

Product Table:

| Column Name | Data Type | |-------------|-------------------| | ProductID | INT (Primary Key) | | ProductName | VARCHAR(255) | | Category | VARCHAR(100) | | ListPrice | DECIMAL(10,2) |

Employee Table:

| Column Name | Data Type | |--------------|-------------------| | EmployeeID | INT (Primary Key) | | FirstName | VARCHAR(100) | | LastName | VARCHAR(100) | | Salary | DECIMAL(10,2) |

UserSession Table:

| Column Name | Data Type | |----------------|-------------------| | SessionID | INT (Primary Key) | | UserID | INT | | MinutesWatched | INT |

📚 Real-World Context
Imagine you are working as a data analyst at an e-commerce company. Your team needs to understand the distribution of products across different categories to optimize inventory and marketing strategies. By grouping products using the `GROUP BY` clause, you can easily count how many products are in each category, helping your team make informed decisions on where to focus their efforts. Similarly, if you need to report on sales performance, grouping sales data by region allows you to see which areas are performing well and which may need more attention. Understanding how to group data and calculate averages, totals, or counts is a fundamental skill in data analysis, enabling you to turn raw data into actionable insights.
Challenge

Filtering Groups with HAVING

Step 2: Filtering Groups with HAVING

Once you've grouped data using the GROUP BY clause, there may be times when you need to filter these groups based on certain conditions. The HAVING clause in SQL allows you to apply filters to groups after the aggregation has been performed, making it a powerful tool for refining your query results. In this step, you’ll learn how to use the HAVING clause to filter grouped data, ensuring that only the most relevant information is included in your output.

🫙Database Structure

Tables You Will Be Working With:

Product Table:

| Column Name | Data Type | |-------------|-------------------| | ProductID | INT (Primary Key) | | ProductName | VARCHAR(255) | | Category | VARCHAR(100) | | ListPrice | DECIMAL(10,2) |

Employee Table:

| Column Name | Data Type | |--------------|-------------------| | EmployeeID | INT (Primary Key) | | FirstName | VARCHAR(100) | | LastName | VARCHAR(100) | | Salary | DECIMAL(10,2) |

UserSession Table:

| Column Name | Data Type | |----------------|-------------------| | SessionID | INT (Primary Key) | | UserID | INT | | MinutesWatched | INT |

📚 Real-World Context
As a data professional, you often need to focus on the most significant data points within a dataset. For example, if you're tasked with identifying key product categories that drive the most revenue, you don't want to include categories with minimal sales. The `HAVING` clause allows you to filter your grouped data to focus only on the groups that meet specific criteria, such as categories with more than 100 products or regions with sales exceeding 10,000. This capability is essential in business intelligence, where targeted analysis can lead to more strategic decisions and better resource allocation. Whether you're managing budgets, analyzing sales, or evaluating department performance, filtering grouped data with `HAVING` helps you concentrate on what matters most.
Challenge

Distinct Values

Step 3: Distinct Values

In SQL, retrieving distinct values from a table is crucial when you need to eliminate duplicates and understand the unique elements within your data. The DISTINCT keyword is used to return only unique values from a column, ensuring that each value appears only once in the results. In this step, you will learn how to use DISTINCT to retrieve unique values from various columns in your database.

🫙Database Structure

Tables You Will Be Working With:

Product Table:

| Column Name | Data Type | |-------------|-------------------| | ProductID | INT (Primary Key) | | ProductName | VARCHAR(255) | | Category | VARCHAR(100) | | ListPrice | DECIMAL(10,2) |

Employee Table:

| Column Name | Data Type | |--------------|-------------------| | EmployeeID | INT (Primary Key) | | FirstName | VARCHAR(100) | | LastName | VARCHAR(100) | | Salary | DECIMAL(10,2) |

UserSession Table:

| Column Name | Data Type | |----------------|-------------------| | SessionID | INT (Primary Key) | | UserID | INT | | MinutesWatched | INT |

📚 Real-World Context
When managing large datasets, it's common to encounter duplicate entries that can obscure the unique values within your data. For instance, if you're preparing a report on customer demographics, you need to know all the distinct countries your customers are from, rather than seeing the same country listed multiple times. The `DISTINCT` keyword in SQL allows you to retrieve only the unique values, providing a clearer picture of the diversity in your data. Whether you're compiling a list of job titles within a company, identifying unique product categories, or analyzing customer locations, understanding how to use `DISTINCT` helps you accurately represent the data and avoid redundancy in your reports and analyses.

About the author

Real skill practice before real-world application

Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.

Learn by doing

Engage hands-on with the tools and technologies you’re learning. You pick the skill, we provide the credentials and environment.

Follow your guide

All labs have detailed instructions and objectives, guiding you through the learning process and ensuring you understand every step.

Turn time into mastery

On average, you retain 75% more of your learning if you take time to practice. Hands-on labs set you up for success to make those skills stick.

SQL Fundamentals: Grouping and Summarizing Data

Lab Info

Table of Contents

Grouping Data with GROUP BY

SQL Guide

Step 1: Grouping Data with GROUP BY

Tables You Will Be Working With:

Filtering Groups with HAVING

Step 2: Filtering Groups with HAVING

Tables You Will Be Working With:

Distinct Values

Step 3: Distinct Values

Tables You Will Be Working With:

About the author

Real skill practice before real-world application

Learn by doing

Follow your guide

Turn time into mastery

Get started with Pluralsight