Featured resource
Forrester Wave Report 2025
Pluralsight named a Leader in the Forrester Wave™

Our tech skill development platform earned the highest scores possible across 11 criteria.

Learn more
  • Course
    • Libraries: If you want this course, consider one of these libraries.
    • Data

Understanding Apache Iceberg Architecture

You’re familiar with the “what” and “why” of Apache Iceberg. This internals course will focus on the “how” of Apache Iceberg: specifically, how it is able to accomplish what it does as an open-storage format powering petabyte-scale data lakes.

Russ Thomas - Pluralsight course - Understanding Apache Iceberg Architecture
by Russ Thomas

What you'll learn

Many courses and white sheets on Apache Iceberg do a good job of describing what it is and why you might want to consider it as a storage format for your data lake or lake house platform.

In this course, Understanding Apache Iceberg Architecture, you’ll gain an understanding of the internals of Apache Iceberg.

First, you’ll explore Apache Iceberg’s layered approach to data and meta-data storage and management, and how Iceberg is able to fully decouple storage from the query engine and catalog.

Next, you’ll discover how Iceberg handles schema evolution gracefully while maintaining full ACID compliance, allowing data engineers to work with a petabyte-scale platform as they would a traditional data platform like SQL Server or Oracle.

Finally, you’ll learn how Iceberg manages and facilitates storage and query optimizations such as partitioning, compaction, and garbage collection.

When you’re finished with this course, you’ll have the skills and knowledge of Apache Iceberg architecture needed to fully assess and support it in your own data project.

Table of contents

About the author

Russ Thomas - Pluralsight course - Understanding Apache Iceberg Architecture
Russ Thomas

Currently an IT leader in Denver Colorado's financial sector Russ has focused on database development, modelling, administration, and BI since 1997 across the Microsoft stack. Russ is a passionate trainer and SQL community volunteer presenting regularly at PASS SQL Saturday events and local user groups around the US.

More Courses by Russ