Skip to content

Contact sales

By filling out this form and clicking submit, you acknowledge our privacy policy.
  • Labs icon Lab
  • A Cloud Guru
Google Cloud Platform icon

Replicating Data Between Two Kafka Clusters

Kafka can be deployed in mutliple data centers in an "Active-Active", "Active-Passive", or centralized architecture. In this hands-on lab, we simulate an active-passive architecture in which data from one cluster is replicated to another cluster using a tool called Replicator. Replicator, like MirrorMaker, allows you to preserve topic configuration in the source cluster while replicating the messages from one cluster to another.

Google Cloud Platform icon

Path Info

Clock icon Intermediate
Clock icon 1h 45m
Clock icon Jul 17, 2019

Contact sales

By filling out this form and clicking submit, you acknowledge our privacy policy.

Table of Contents

  1. Challenge

    Start the Destination Cluster

    Note: After connecting via ssh, please give the lab extra time to finish provisioning. If you get a dpkg frontend lock message when running sudo apt install -y default-jdk, please give the lab more time before retrying the command.

    Start Zookeeper.

    bin/zookeeper-server-start etc/kafka/

    Start Kafka.

    bin/kafka-server-start etc/kafka/
  2. Challenge

    Start the Origin Cluster

    Make a copy of the configuration files.

    cp etc/kafka/ /tmp/
    cp etc/kafka/ /tmp/

    Change the port numbers for the origin cluster.

    sed -i -e "s/2181/2171/g" /tmp/
    sed -i -e "s/9092/9082/g" /tmp/
    sed -i -e "s/2181/2171/g" /tmp/
    sed -i -e "s/#listen/listen/g" /tmp/

    Change the data directory for the origin cluster.

    sed -i -e "s/zookeeper/zookeeper_origin/g" /tmp/
    sed -i -e "s/kafka-logs/kafka-logs-origin/g" /tmp/

    Start Zookeeper.

    bin/zookeeper-server-start /tmp/

    Start Kafka.

    bin/kafka-server-start /tmp/
  3. Challenge

    Create a Topic in the Source Cluster

    Create a topic named test-topic with 1 partition and a replication factor of 1.

    bin/kafka-topics --create --topic test-topic --replication-factor 1 --partitions 1 --zookeeper localhost:2171
  4. Challenge

    Run the Replicator Tool

    Run Kafka Connect in Standalone Mode and pass in the file

    bin/connect-standalone etc/kafka/ etc/kafka-connect-replicator/
  5. Challenge

    Produce and Verify Replication

    Verify that the topic was replicated to the destination cluster.

    bin/kafka-topics --describe --topic test-topic.replica --zookeeper localhost:2181

    Open a Console Producer and write to the topic test-topic in the source cluster.

    seq 10000 | bin/kafka-console-producer --topic test-topic --broker-list localhost:9082

    Confirm the messages were replicated by opening a console consumer to the destination cluster.

    bin/kafka-console-consumer --from-beginning --topic test-topic.replica --bootstrap-server localhost:9092

The Cloud Content team comprises subject matter experts hyper focused on services offered by the leading cloud vendors (AWS, GCP, and Azure), as well as cloud-related technologies such as Linux and DevOps. The team is thrilled to share their knowledge to help you build modern tech solutions from the ground up, secure and optimize your environments, and so much more!

What's a lab?

Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.

Provided environment for hands-on practice

We will provide the credentials and environment necessary for you to practice right within your browser.

Guided walkthrough

Follow along with the author’s guided walkthrough and build something new in your provided environment!

Did you know?

On average, you retain 75% more of your learning if you get time for practice.

Start learning by doing today

View Plans