The DP-750 course prepares participants to design, implement, and maintain data engineering solutions on Azure Databricks. It covers configuring Databricks environments and Unity Catalog, ingesting and transforming data using SQL and Python, building and deploying data pipelines, enforcing data quality and governance, and monitoring and optimizing workloads. Participants will also learn software development lifecycle (SDLC) practices including Git-based version control and CI/CD with Databricks Asset Bundles.
Prerequisites:
In order to succeed in this course, you will need:
- Proficiency in SQL and Python for data ingestion and transformation
- Experience with software development lifecycle (SDLC) practices, including Git
- Familiarity with Microsoft Entra, Azure Data Factory, and Azure Monitor
- Experience with data integration, modeling, and pipeline development
- Understanding of data quality and governance concepts
Purpose
| Learn to design, implement, and maintain data engineering solutions on Azure Databricks while preparing for the DP-750 certification exam |
Audience
| Technical professionals working alongside administrators, data scientists, and data analysts to deploy and secure Databricks-based data solutions |
Role
| Data Engineers |Â Analytics Engineers |Â Platform/Solution ArchitectsÂ
|
Skill level
| Intermediate |
Style
| Lecture | Hands-on Activities | Labs |
Duration
| 4 days |
Related technologies
| Cloud | Python | SQL | Git |Â |
Â
Learning objectives
- Secure Unity Catalog objects using access controls, row/column-level security, managed identities, and service principals
- Design and implement data models including partitioning, clustering, slowly changing dimensions, and temporal tables
- Ingest data using Lakeflow Connect, notebooks, SQL methods, CDC feeds, Spark Structured Streaming, and Azure Event Hubs
- Cleanse, transform, and load data by profiling, resolving data quality issues, and applying joins, aggregations, and merge operations
- Design, build, and schedule data pipelines using Lakeflow Jobs and Lakeflow Spark Declarative Pipelines
- Apply SDLC best practices including Git version control, branching strategies, testing, and deployment via Databricks CLI and REST APIs
- Monitor, troubleshoot, and optimize workloads using Spark UI, DAGs, Delta table commands, and Azure Monitor