Featured resource
2025 Tech Upskilling Playbook
Tech Upskilling Playbook

Build future-ready tech teams and hit key business milestones with seven proven plays from industry leaders.

Check it out
  • Lab
    • Libraries: If you want this lab, consider one of these libraries.
    • Cloud
    • Data
Azure icon
Labs

Work with GitHub in Azure Data Factory

Azure Data Factory and Synapse Pipelines allow you to connect and configure a Git repository, where all the artifacts can be stored. In this lab, we'll create a new source control repository in GitHub. We'll then configure GitHub with Azure Data Factory by defining collaboration and publish branches. Then, we'll see what changes can be committed to these branches, and how it generates an ARM template for deployment.

Azure icon
Lab platform
Lab Info
Level
Intermediate
Last updated
Sep 22, 2025
Duration
30m

Contact sales

By filling out this form and clicking submit, you acknowledge our privacy policy.
Table of Contents
  1. Challenge

    Set Up the Azure Environment
    1. Create an Azure Data Lake Gen2 account.
    2. Create a container called taxidata in the Data Lake account.
    3. Upload the TaxiRides.csv file to the container.

      Note: Make sure you have downloaded the TaxiRides.csv file that is located in this GitHub repository.

    4. Create an Azure Data Factory instance.
  2. Challenge

    Set Up a New Repository in GitHub
    1. Log in to an existing GitHub account, or create a new GitHub account if necessary. The link to the GitHub site is available in the Additional Information and Resources section.
    2. Create a new public repository.
  3. Challenge

    Configure GitHub in Azure Data Factory
    1. Authenticate to GitHub from Data Factory, and configure the repository.
    2. Define collaboration and publish branches.
    3. Select the collaboration branch as the working branch.
  4. Challenge

    Save Changes to Collaboration Branch
    1. Create a pipeline with Copy activity. Do not fill all the properties so that the pipeline remains invalid.
    2. Save the pipeline with invalid changes, and verify them in the collaboration branch of the GitHub repository.
    3. Complete the pipeline by copying a file from one data lake folder to another. Create a linked service to the data lake and two datasets (one for the source file and the other for the sink file).
    4. Save the pipeline with valid changes, and verify them in the collaboration branch of the GitHub repository.
  5. Challenge

    Publish the Changes to Publish Branch
    1. Publish the pipeline with valid changes.
    2. Verify that an ARM (Azure Resource Manager) template has been generated and stored in the publish branch.
About the author

Pluralsight Skills gives leaders confidence they have the skills needed to execute technology strategy. Technology teams can benchmark expertise across roles, speed up release cycles and build reliable, secure products. By leveraging our expert content, skill assessments and one-of-a-kind analytics, keep up with the pace of change, put the right people on the right projects and boost productivity. It's the most effective path to developing tech skills at scale.

Real skill practice before real-world application

Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.

Learn by doing

Engage hands-on with the tools and technologies you’re learning. You pick the skill, we provide the credentials and environment.

Follow your guide

All labs have detailed instructions and objectives, guiding you through the learning process and ensuring you understand every step.

Turn time into mastery

On average, you retain 75% more of your learning if you take time to practice. Hands-on labs set you up for success to make those skills stick.

Get started with Pluralsight