Data Wrangling with Azure
In this lab, you’ll practice how to cleanse your data with Azure Data Factory and Azure DataFlow scripts. When you’re finished with this lab, you’ll be able to master Data Wrangling in Azure.
Terms and conditions apply.
Create an Azure Data Factory Pipeline for Data Wrangling
You will create an Azure Data Factory Pipeline for Data Cleansing and configure a Debug session.
Configure an Azure Blob as Pipeline Source
You will configure a file in an Azure Blob Storage container as source for the Pipeline.
Delete Duplicate Rows in Data flow
You'll remove duplicate rows in Data flow.
Split Pipeline Data Based on Null Conditions
You will finish the cleanse Azure Data Pipeline by checking NULL values to load into Azure CosmosDB clean data.
Run the Azure Data Factory Pipeline and Validate in CosmosDB
You will trigger the cleanse Pipeline and verify in CosmosDB the movies were inserted correctly.
Provided environment for hands-on practice
We will provide the credentials and environment necessary for you to practice right within your browser.
Follow along with the author’s guided walkthrough and build something new in your provided environment!
Did you know?
On average, you retain 75% more of your learning if you get time for practice.
- Azure Blob Storage
- Azure Data Factory
- Azure DataFlow (optional)