Course

Skills Expanded

Building Data Pipelines with Luigi 3 and Python

by Dan Tofan

Other developers implement data pipelines by putting together a bunch of hacky scripts, that over time turn into liabilities and maintenance nightmares. Take this course to implement sane and smart data pipelines with Luigi in Python.

Preview this course

Try for free

Get this course plus top-rated picks in tech skills and other popular topics.

$45.00

per month after 10 day trial

Your 10 day Premium free trial includes

Expanded library

This course and over 7,000+ additional courses from our full course library.

Hands-on library

Practice and apply knowledge faster in real-world scenarios with projects and interactive courses.

*Available on Premium only

For teams

Give up to 50 users access to our full library including this course free for 30 days

Course info

Rating

(20)

Level

Intermediate

Updated

Jun 17, 2024

Duration

1h 34m

What you'll learn

Data arrives from various sources and needs further processing. It's very tempting to re-invent the wheel and write your own library to build data pipelines for batch processing. This results in data pipelines that are difficult to maintain. In this course, Building Data Pipelines with Luigi and Python, you’ll learn how to build data pipelines with Luigi and Python. First, you’ll explore how to build your first data pipelines with Luigi. Next, you’ll discover how to configure Luigi pipelines. Finally, you’ll learn how to run Luigi pipelines. When you’re finished with this course, you’ll have the Luigi skills and knowledge for building data pipelines that are easy to maintain.

Course Overview

1min

Course Overview 2m

Getting Started with Luigi

25mins

Building Luigi Pipelines

26mins

Build a Pipeline with Three Successive Tasks 6m
Build a Pipeline with Three Parallel Tasks 6m
Build a Pipeline with Parallel and Successive Tasks 6m
Build a Pipeline with Many Tasks 5m
Build a Pipeline with Dynamic Tasks 5m

Configuring Luigi Pipelines

17mins

Add Parameters to a Pipeline 5m
Start a Pipeline with Parameters 4m
Using Configuration Files 4m
More Types of Parameters 4m

Running Luigi Pipelines

23mins

Starting a Pipeline 8m
Scheduling a Pipeline 5m
Tracking Pipeline Progress 5m
Luigi Alternatives 2m
Summary 3m

Course FAQ

What is a data pipeline?

A data pipeline is a series of data processing steps. Data pipelines consist of three components: a source, a processing step or steps, and a destination.

What prerequisites are needed for this course?

Prerequisites for this course are fluency within Python and familiarity with linux command line.

What is Luigi in python?

Luigi is a package within Python that helps you build complex pipelines of data intense jobs. Luigi handles dependency resolution, workflow management, visualization, handling failures, and command line integration.

What are the benefits of python?

Some benefits of Python are: easy to read, learn, and write, open-source, portable, dynamically typed, and provides extensive support libraries.

What are data pipelines used for?

Data pipelines are primarily used to automate the process of extracting, transforming, and loading data.

About the author

Dan Tofan

Dan started programming decades ago on a Spectrum clone and started his professional programming career in 2003. Eager to learn, Dan moved to Netherlands to study at the University of Groningen. Now, Dan is proud of his PhD thesis on decision making and knowledge acquisition in software architecture, and about a dozen publications with hundreds of citations. Dan used Microsoft technologies for many years, but migrated gradually to Python, Linux and AWS, to learn more of the computing world. Cur... more

See more courses by Dan Tofan

Try for free

Get this course plus top-rated picks in tech skills and other popular topics.

$45.00

per month after 10 day trial

Your 10 day Premium free trial includes

Expanded library

This course and over 7,000+ additional courses from our full course library.

Hands-on library

Practice and apply knowledge faster in real-world scenarios with projects and interactive courses.

*Available on Premium only

For teams

Give up to 50 users access to our full library including this course free for 30 days

Course info

Rating

(20)

Level

Intermediate

Updated

Jun 17, 2024

Duration

1h 34m

Ready to upskill? Get started

Contact Sales

Building Data Pipelines with Luigi 3 and Python

What you'll learn

Table of contents

Course FAQ

About the author

Ready to skill up
your entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Ready to skill up
your entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Support

Community

Company

Industries

Newsletter

Contact Sales

Building Data Pipelines with Luigi 3 and Python

What you'll learn

Table of contents

Course FAQ

About the author

Get access now

Ready to skill upyour entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Ready to skill upyour entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Support

Community

Company

Industries

Newsletter

Ready to skill up
your entire team?

Ready to skill up
your entire team?