Course

Extracting Data from HTML with BeautifulSoup

This course covers the important aspects of scraping websites using Beautiful Soup. You will learn to build, manipulate and traverse the parse tree, as well as to leverage advanced features such as working with filters, CSS and XPath.

Intermediate

2h 26m

(22)

Created by Janani Ravi

Last Updated Feb 05, 2020

Get started today

Access this course and other top-rated tech content with one of our business plans.

Start a free team trial

Buy now

Try this course for free

Access this course and other top-rated tech content with one of our individual plans.

Start a free trial

Buy now

This course is included in the libraries shown below:

Data

Course

Extracting Data from HTML with BeautifulSoup

Intermediate

2h 26m

(22)

Created by Janani Ravi

Last Updated Feb 05, 2020

Get started today

Access this course and other top-rated tech content with one of our business plans.

Start a free team trial

Buy now

Try this course for free

Access this course and other top-rated tech content with one of our individual plans.

Start a free trial

Buy now

This course is included in the libraries shown below:

Data

What you'll learn

Web scraping is an important technique that is widely used as the first step in many workflows in data mining, information retrieval, and text-based machine learning.

In this course, Extracting Data from HTML with BeautifulSoup* you will gain the ability to build robust, maintainable web scraping solutions using the Beautiful Soup library in Python.

First, you will learn how regular expressions can be used to scrape web content, and how Beautiful Soup does better in important ways. Next, you will discover how Beautiful Soup parses HTML from web content, fixes up badly-formed tags, and builds a clean, easily traversable parse tree. You will then see how that parse tree can be used in order to find and retrieve specific patterns.

Finally, you will round out your knowledge by leveraging advanced features of beautiful soup such as working with CSS and XPath. When you’re finished with this course, you will have the skills and knowledge to implement robust web scraping using Beautiful Soup.

Extracting Data from HTML with BeautifulSoup

Intermediate

2h 26m

(22)

Table of contents

Course Overview | 1m 33s

About the author

Janani Ravi

200 courses

4.5 author rating

6281 ratings

A problem solver at heart, Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework.

More Courses by Janani

Extracting Data from HTML with BeautifulSoup

Extracting Data from HTML with BeautifulSoup

Get started today

Try this course for free

Extracting Data from HTML with BeautifulSoup

What you'll learn

Extracting Data from HTML with BeautifulSoup

Course Overview 1m

Getting Started with BeautifulSoup 45m

Navigating the Parse Tree 40m

Searching for Elements in the Parse Tree 29m

Leveraging Advanced Features of BeautifulSoup 28m

2025 Forrester Wave™ names Pluralsight as a Leader among tech skills dev platforms