Featured resource
2026 Tech Forecast
2026 Tech Forecast

Stay ahead of what’s next in tech with predictions from 1,500+ business leaders, insiders, and Pluralsight Authors.

Get these insights
  • Course

Real-time Stream Processing with PySpark

Apache Spark is the most widely used analytics engine for large-scale data processing. This course will teach you how to process real-time data streams and productionize real-time data applications.

Intermediate
1h 4m
(0)

Created by Ivan Gavryliuk

Last Updated Apr 17, 2025

Course Thumbnail
  • Course

Real-time Stream Processing with PySpark

Apache Spark is the most widely used analytics engine for large-scale data processing. This course will teach you how to process real-time data streams and productionize real-time data applications.

Intermediate
1h 4m
(0)

Created by Ivan Gavryliuk

Last Updated Apr 17, 2025

Get started today

Access this course and other top-rated tech content with one of our business plans.

Try this course for free

Access this course and other top-rated tech content with one of our individual plans.

This course is included in the libraries shown below:

  • Data
What you'll learn

Handling real-time data streams is crucial for modern applications, but many find it challenging to process and analyze data efficiently as it arrives.

In this course, Real-time Stream Processing with PySpark, you’ll gain the ability to build and deploy scalable, real-time data applications using Apache Spark and Python.

First, you’ll explore the fundamentals of the modern Spark Streaming and structured streaming concepts.

Next, you’ll discover advanced streaming techniques, such as window operations, stateful transformations, and fault tolerance, to enhance the reliability and performance of your applications.

Finally, you’ll learn how to integrate PySpark with various data sources and sinks, enabling seamless data ingestion and output to and from your streaming applications.

When you’re finished with this course, you’ll have the skills and knowledge of stream processing with PySpark needed to develop robust, real-time data processing systems that can handle large-scale data streams efficiently.

Real-time Stream Processing with PySpark
Intermediate
1h 4m
(0)
Table of contents

About the author
Ivan Gavryliuk - Pluralsight course - Real-time Stream Processing with PySpark
Ivan Gavryliuk
5 courses 4.6 author rating 364 ratings

Ivan is a technical architect and cloud consultant with over 15 years of experience in building software for start-ups and enterprises. He is a blogger, father, thinker, and cyclist.

Get started with Pluralsight