Simple play icon Course
Skills

Crawling the Web with Python 3 and Scrapy 2

by Ed Freitas

Have you ever wanted to know how to programmatically crawl websites and extract data from them? If so, then this course is for you. You will learn how to use the Scrapy framework to write spiders that are able to extract valuable data from the web.

What you'll learn

Have you ever spent hours trying to gather high-quality data from specific websites, and wondered how you could extract this data programmatically and use it within your own applications? In this course, Crawling the Web with Python 3 and Scrapy 2, you will gain the ability to write spiders that can extract data from the web, using Python and Visual Studio Code, through an advanced yet easy-to-use framework called Scrapy. First, you will learn what scraping and crawling are, and explore all its implications. Next, you will discover how to scaffold a Scrapy project and write spiders. Finally, you will explore how to influence how spiders crawl websites and extract data in different formats. When you are finished with this course, you will have the skills and knowledge on how to use Scrapy with Python, to programmatically crawl and scrape data from any website.

About the author

Eduardo is a technology enthusiast, software architect and customer success advocate. He's designed enterprise .NET solutions that extract, validate and automate critical business processes such as Accounts Payable and Mailroom solutions for all types of organizations. He's designed and supported production systems for global names such as Coca Cola, Enel, Pirelli, Fiat-Chrysler, Xerox and many others. He's a well-known specialist in the Enterprise Content Management market segment, specifically... more

Ready to upskill? Get started