-
Course
- Data
Introduction to Apache Iceberg
Storing petabyte-scale data while remaining flexible with tools, pipelines, and cloud vendors has been an insurmountable challenge. This course will introduce you to a viable solution, Apache Iceberg.
What you'll learn
Vendor lock-in, architectural limitations, rigidity of schema, and partitions are issues faced by almost every table storage format when attempting to store petabyte-scale data.
In this course, Introduction to Apache Iceberg, you’ll gain applicable familiarity with Apache Iceberg, an incredibly flexible table storage format with growing industry support.
First, you’ll explore the common limitations of traditional approaches to table storage, seeing how Iceberg addresses these limitations and opens up options to nearly all the major data platforms.
Next, you’ll discover key features that make Apache Iceberg capable of petabyte-scale data management, such as schema evolution, time travel, and hidden partitions.
Finally, you’ll learn how abstracting your data storage from your query engine offers surprising flexibility and options both in the cost of your data storage as well as the flexibility for sharing data between platforms.
When you’re finished with this course, you’ll have a foundational understanding of Apache Iceberg and understand why it’s becoming such a popular option across many clouds and platforms.
Table of contents
About the author
Currently an IT leader in Denver Colorado's financial sector Russ has focused on database development, modelling, administration, and BI since 1997 across the Microsoft stack. Russ is a passionate trainer and SQL community volunteer presenting regularly at PASS SQL Saturday events and local user groups around the US.
More Courses by Russ