Course

Improving Azure Data Lake Performance

Running queries in Azure Data Lake? Are your queries costing too much? This course will help you learn how to take control of your Data Lake. Grab those pesky queries by the scruff of the neck and improve Azure Data Lake performance!

Advanced

1h 39m

(19)

Created by Mike McQuillan

Last Updated Jul 08, 2024

Get started today

Access this course and other top-rated tech content with one of our business plans.

Start a free team trial

Buy now

Try this course for free

Access this course and other top-rated tech content with one of our individual plans.

Start a free trial

Buy now

This course is included in the libraries shown below:

Data

Course

Improving Azure Data Lake Performance

Advanced

1h 39m

(19)

Created by Mike McQuillan

Last Updated Jul 08, 2024

Get started today

Access this course and other top-rated tech content with one of our business plans.

Start a free team trial

Buy now

Try this course for free

Access this course and other top-rated tech content with one of our individual plans.

Start a free trial

Buy now

This course is included in the libraries shown below:

Data

What you'll learn

OK, so you are using Azure Data Lakes, and you think it's great. You just wish you could improve the performance of your U-SQL queries. Why does that query always read your entire data set? Why does this query take forever to complete? Like anything else in the Big Data world, your Azure Data Lake has to be structured around your data. This course, Improving Azure Data Lake Performance, will show you how to put the right structure in place. Then watch the magic start to happen! First, you'll see how an Azure Data Lake works behind the scenes – how it handles different types of data and how the storage of that data can be optimized. Next, you'll see how it's possible to optimize non-structured data. Finally, you'll be shown how structuring your data opens up a world of possibilities, including horizontal and vertical partitioning. This is where the real power of the Azure Data Lake comes to light! Horizontal partitioning allows you to defer a lot of control to the Data Lake, whereas vertical partitioning allows you – the developer – to take total control of how your data is partitioned and distributed within the Data Lake. When you're finished with this course, you'll understand how you can better optimize your jobs and save some cash. Software required: Visual Studio Community Edition 2017 with the Azure Data Lake and Stream Analytics Tools installed.

Improving Azure Data Lake Performance

Advanced

1h 39m

(19)

Table of contents

Course Overview | 2m 24s

About the author

Mike McQuillan

16 courses

4.5 author rating

610 ratings

Mike loves to mess around with data and programming problems, the bigger the better. He’s worked with a variety of companies, helping to build and improve systems of all shapes and sizes.

More Courses by Mike

Improving Azure Data Lake Performance

Improving Azure Data Lake Performance

Get started today

Try this course for free

Improving Azure Data Lake Performance

What you'll learn

Improving Azure Data Lake Performance

Course Overview 2m

Why Bother Organizing an Azure Data Lake? 45m

Dividing and Conquering an Azure Data Lake 27m

Kicking the Bucket – Manually Dividing an Azure Data Lake 24m

2025 Forrester Wave™ names Pluralsight as a Leader among tech skills dev platforms