Featured resource
Tech Upskilling Playbook 2025
Tech Upskilling Playbook

Build future-ready tech teams and hit key business milestones with seven proven plays from industry leaders.

Learn more
  • Labs icon Lab
  • Cloud
Google Cloud Platform icon
Labs

Joining Datasets with KSQL

KSQL provides a SQL-like interface for most of the operations you perform using Kafka Streams. Like Kafka Streams, KSQL is capable of joining multiple streams into a single dataset. In this lab, we will work with joins in KSQL by writing a persistent streaming query that joins two streams. This will give you some hands-on experience with joins in KSQL.

Google Cloud Platform icon
Labs

Path Info

Level
Clock icon Intermediate
Duration
Clock icon 30m
Last updated
Clock icon Aug 12, 2025

Contact sales

By filling out this form and clicking submit, you acknowledge our privacy policy.

Table of Contents

  1. Challenge

    Create Streams for Both Input Topics

    1. Start a KSQL session:

      sudo ksql
      
    2. Set auto.offset.reset to earliest so that all streams will process the existing test data:

      SET 'auto.offset.reset' = 'earliest';
      
    3. View the data in the member_signups topic:

      PRINT 'member_signups' FROM BEGINNING;
      
    4. Create a stream for the member_signups topic:

      CREATE STREAM member_signups
        (lastname VARCHAR,
          firstname VARCHAR)
        WITH (KAFKA_TOPIC='member_signups',
          VALUE_FORMAT='DELIMITED');
      
    5. View the data in the member_contact topic:

      PRINT 'member_contact' FROM BEGINNING;
      
    6. Create a stream for the member_contact topic:

      CREATE STREAM member_contact
        (email VARCHAR)
        WITH (KAFKA_TOPIC='member_contact',
          VALUE_FORMAT='DELIMITED');
      
  2. Challenge

    Create a Persistent Streaming Query to Join the Two Streams and Output the Result

    1. Create a persistent streaming query to join the two streams:

      CREATE STREAM member_email_list AS
        SELECT member_signups.firstname, member_signups.lastname, member_contact.email
        FROM member_signups
        INNER JOIN member_contact WITHIN 365 DAYS ON member_signups.rowkey = member_contact.rowkey;
      
    2. Check the output topic to verify the correct data is present:

      PRINT 'MEMBER_EMAIL_LIST' FROM BEGINNING;
      

Pluralsight Skills gives leaders confidence they have the skills needed to execute technology strategy. Technology teams can benchmark expertise across roles, speed up release cycles and build reliable, secure products. By leveraging our expert content, skill assessments and one-of-a-kind analytics, keep up with the pace of change, put the right people on the right projects and boost productivity. It's the most effective path to developing tech skills at scale.

What's a lab?

Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.

Provided environment for hands-on practice

We will provide the credentials and environment necessary for you to practice right within your browser.

Guided walkthrough

Follow along with the author’s guided walkthrough and build something new in your provided environment!

Did you know?

On average, you retain 75% more of your learning if you get time for practice.