Web Scraping: Python Data Playbook
Learn how to tell a compelling graphical data story in a Jupyter Notebook with Seaborn having scraped information from a static web page with BeautifulSoup4 when no API is available.
What you'll learn
Scrape data from a static web page with BeautifulSoup4 and turn it into a compelling graphical data story in a Jupyter Notebook. In this course, Web Scraping: The Python Data Playbook, you will gain the ability to scrape data and present it graphically. First, you will learn to scrape using the requests module and BeautifulSoup4. Next, you will discover how to write a trustworthy scraping module backed by a unit test. Finally, you will explore how to turn the columns of data in a graphical story that will change the opinions of your colleagues. When you're finished with this course, you will have the skills and knowledge of web scraping needed to create a graphically compelling Jupyter Notebook without the use of an API.
Table of contents
- A Primer on HTML and CSS 4m
- Understanding the HTML, CSS and Structure of Our Target Page 4m
- Coming up with a Strategy for a More Complicated Web Page 3m
- Using BeautifulSoup4 to Navigate Our Scraped Data 5m
- Extracting Information from a Scraped Division 3m
- Using Selectors as an Alternative to the Find Method 3m
- Advice and Strategy for Scraping 2m
- Building the Scraper Module Using PyCharm 5m
- Dealing with Missing Data during the Scrape 2m
- Refactoring Our Code and Caching Our Scraped Data 2m
- Adding a Test to Verify Our Regular Expression Processing 6m
- Exporting Scraped Data to a CSV File 2m
- Getting a Data Overview with Pandas 3m
- Exploratory Data Analysis Strategy 4m
- Reviewing Our Hypothesis 4m
- Investigating Relationships between MPG and Weight 3m
- Understanding How Cylinders and Displacement Are Related 2m
- Looking at MPG over the Years 2m
- Understanding Brands and Territories with Text Processing 2m
- Telling a Data Story to Explain Our Discoveries 3m