Course

Skills

Processing Streaming Data Using Apache Flink

by Janani Ravi

Apache Flink is built on the concept of stream-first architecture where the stream is the source of truth.

Preview this course

Try for free

Get this course plus top-rated picks in tech skills and other popular topics.

$29.00

per month after 10 day trial

Your 10 day Standard free trial includes

Expert-led courses

Keep up with the pace of change with thousands of expert-led, in-depth courses.

For teams

Give up to 50 users access to our full library including this course free for 30 days

Course info

Rating

(12)

Level

Intermediate

Updated

Dec 7, 2020

Duration

3h 21m

What you'll learn

Flink is a stateful, tolerant, and large scale system which works with bounded and unbounded datasets using the same underlying stream-first architecture. In this course, Processing Streaming Data Using Apache Flink, you will integrate your Flink applications with real-time Twitter feeds to perform analysis on high-velocity streams.

First, you’ll see how you can set up a standalone Flink cluster using virtual machines on a cloud platform. Next, you will install and work with the Apache Kafka reliable messaging service.

Finally, you will perform a number of transformation operations on Twitter streams, including windowing and join operations.

When you are finished with this course you will have the skills and knowledge to work with high volume and velocity data using Flink and integrate with Apache Kafka to process streaming data.

Course Overview

2mins

Course Overview 2m

Getting Started with a Standalone Cluster in Flink

51mins

Integrating Flink with Apache Kafka

53mins

Introducing Apache Kafka 2m
Publishers, Consumers, and Topics in Kafka 6m
Demo: Installing and Running Apache Kafka 3m
Demo: Kafka Producers, Consumers, and Topics 3m
Demo: Integrating a Flink Application with Kafka 6m
Demo: Reading from and Writing to Kafka in a Flink Application 6m
Demo: Creating and Accessing Developer Keys for Twitter 5m
Demo: Reading from the Twitter API and Publishing to a Kafka Topic 7m
Demo: Extracting Hashtags from Twitter Messages 5m
Demo: Reading from Multiple Kafka Topics 6m
Demo: Connecting to the Twitter Source Directly from a Flink Application 6m

Processing High-velocity Streaming Data Using Windowing Operations

41mins

Windowing and the Notion of Time 6m
Demo: Extracting Event Time and Associating Processing Time 6m
Demo: Event Time Tumbling and Sliding Windows 6m
Demo: Event Time Global and Session Windows 3m
Demo: Processing Time Tumbling Windows 3m
Backpressure in Streaming 4m
Demo: Monitoring Back Pressure 6m
Demo: Using Rich Functions to Perform Stateful Operations 8m

Processing Streaming Data Using Join Operations

52mins

A Quick Introduction to Joins 4m
Demo: Performing Join Operations on Streams 8m
Introducing Elasticsearch 4m
Demo: Installing Elasticsearch and Creating an Index 3m
Demo: Implementing a Branching Pipeline to Write to Elasticsearch and Kafka 4m
Demo: Writing Data out to an Elasticsearch Index 4m
Introducing Metrics 3m
Demo: Using Counter Metrics to Monitor Applications 8m
Demo: Unit Testing 8m
Demo: Testing Using MiniClusterWithClientResource 5m
Summary and Further Study 2m

About the author

Janani Ravi

Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework. After spending years working in tech in the Bay Area, New York, and Singapore at companies such as Microsoft, Google, and Flipkart, Janani finally decided to combine her love for technology with her passion for teaching. She is now the co-founder of Loonycorn, a content studio focused on providing ... more

See more courses by Janani Ravi

Try for free

Get this course plus top-rated picks in tech skills and other popular topics.

$29.00

per month after 10 day trial

Your 10 day Standard free trial includes

Expert-led courses

Keep up with the pace of change with thousands of expert-led, in-depth courses.

For teams

Give up to 50 users access to our full library including this course free for 30 days

Course info

Rating

(12)

Level

Intermediate

Updated

Dec 7, 2020

Duration

3h 21m

Ready to upskill? Get started

Contact Sales

Processing Streaming Data Using Apache Flink

What you'll learn

Table of contents

About the author

Ready to skill up
your entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Ready to skill up
your entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Support

Community

Company

Industries

Newsletter

Contact Sales

Processing Streaming Data Using Apache Flink

What you'll learn

Table of contents

About the author

Get access now

Ready to skill upyour entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Ready to skill upyour entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Support

Community

Company

Industries

Newsletter

Ready to skill up
your entire team?

Ready to skill up
your entire team?