Skip to content
Pluralsight Logo
  • Pluralsight
  • Skills
  • Flow
  • Blog
    • Sign in to

      Sign in to Pluralsight Skills
    • Sign in to

      Sign in to Pluralsight Flow
Logo for Pluralsight Skills
Pluralsight Logo
  • Sign in
  • Menu
  • Sign in to Pluralsight Skills

    Sign in to Skills

  • Sign in to Pluralsight Flow

    Sign in to Flow

  • Pluralsight
  • Skills
  • Flow
  • Blog
  • Why Skills?
    • Ways to upskill
    • Icon for Courses Courses
    • Icon for Skills assessments Skill assessments
    • Icon for Labs Labs
    • Icon for Hands-on learning Hands-on learning
    • Icon for Certification prep Certification prep
    • See our entire course library
    • Skills for
    • Icon for Software development Software development
    • Icon for IT ops IT Ops
    • Icon for Info & cyber security Info & cybersecurity
    • Icon for Cloud computing Cloud computing
    • Icon for Machine learning Machine learning
    • Icon for Data professional Data professional
    • For individuals
    • Top trending paths
    • Microsoft Azure Deployment
      Microsoft Azure
    • Cleaning Data with R
      Cleaning data with R
    • Ruby Language Fundamentals
      Ruby Language Fundamentals
    • AWS Operations
      AWS Operations
    • Core Python
      Core Python
  • View plans
  • For individuals
  • Contact Sales
  • Try for free

Contact Sales

1020 Redirect Link
Thank you!
Our team will be in touch shortly.

Loading form...

If this message remains, it may be due to cookies being disabled or to an ad blocker.

Close button
×
AWS Labs
Labs badge iconLAB

Implement a Data Ingestion Solution Using AWS Glue

In this lab, you'll practice ingesting semi-structured JSON sample data into a normalized AWS Glue Data Catalog from a source S3 data store. When you're finished, you'll have configured AWS Glue to continuously crawl S3 for new data every 12 hours.

* Our Labs are Available for Enterprise and Professional plans only.
Terms and conditions apply.
Contact sales
Labs - Implement a Data Ingestion Solution Using AWS Glue

Lab info

Rating (256)
Level
Beginner
Duration
50m
Released
Aug 26, 2021

Lab author

Lab Author: Steve Mueller
Steve Mueller
Steve Mueller has spent the past 25 years providing in-person technical expertise to more than 300 of the largest enterprises across 5 continents. Steve spent his formative years with Java application servers while at BEA (now Oracle), where he built core devops and virtualization principles to achieve automation and scale. He has worked for VMware and most recently AWS, where he spent 7 years in a number of roles addressing customer needs. Steve is now the CTO of Hypersive, a startup focused on... more next-generation edge, end-user compute, and device virtualization. He has spoken at numerous industry tradeshows, including AWS re:Invent and Autodesk University, and is passionate about breaking difficult ideas down into easily understood concepts and educating others with that knowledge. He is an established AWS expert, loves anything serverless, believes API gateways are more transformative than we think, and gets energized when talking about containerization, IoT, and event bridges.
  • Table of contents
  • What's a lab?
  • Prerequisites
Challenge

Obtain Source Data Files

Download the JSON source data files from the official AWS Samples GitHub repository that will be imported into S3 and ingested by AWS Glue.

Challenge

Create an S3 Bucket

Provision an S3 bucket that will be used by AWS Glue as the primary data store for its Data Catalog.

Challenge

Upload Source Data Files to S3

Load the JSON source data files into the S3 data store.

Challenge

Create the Crawler

Provision a crawler in AWS Glue to populate the AWS Glue Data Catalog every 12 hours with tables from the source S3 data store.

Challenge

Manually Run the Crawler

Execute the crawler manually to verify it performs as expected, and populate the AWS Glue Data Catalog with tables from the source S3 data store.

Labs - code icon

Provided environment for hands-on practice

We will provide the credentials and environment necessary for you to practice right within your browser.

Labs - list icon

Guided walkthrough

Follow along with the author’s guided walkthrough and build something new in your provided environment!

Labs - analytic icon

Did you know?

On average, you retain 75% more of your learning if you get time for practice.

Recommended prerequisites

  • Amazon S3
  • AWS Glue

Start learning by doing today

Contact sales

Contact Sales

1020 Redirect Link
Thank you!
Our team will be in touch shortly.

Loading form...

If this message remains, it may be due to cookies being disabled or to an ad blocker.

Close button

Ready to skill up
your entire team?

10
Subscriptions
Need more subscriptions? Contact sales.
Continue to checkout Continue to checkout
Cancel

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

  • Access thousands of videos to develop critical skills
  • Give up to 10 users access to thousands of video courses
  • Practice and apply skills with interactive courses and projects
  • See skills, usage, and trend data for your teams
  • Prepare for certifications with industry-leading practice exams
  • Measure proficiency across skills and roles
  • Align learning to your goals with paths and channels

Ready to skill up
your entire team?

10
Subscriptions
Need more subscriptions? Contact sales.
Continue to checkout
Cancel

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

  • Access thousands of videos to develop critical skills
  • Give up to 10 users access to thousands of video courses
  • Practice and apply skills with interactive courses and projects
  • See skills, usage, and trend data for your teams
  • Prepare for certifications with industry-leading practice exams
  • Measure proficiency across skills and roles
  • Align learning to your goals with paths and channels
  • Support

    • Contact
    • Help Center
    • IP Allowlist
    • Site Map
    • Download Pluralsight
    • Skills Plans
    • Flow Plans
  • Community

    • Guides
    • Teach
    • Partner with Pluralsight
    • Affiliate Partners
    • Pluralsight One
    • Authors
  • Company

    • About Us
    • Careers
    • Newsroom
    • Resources
  • Industries

    • Public Sector
    • Non-Profit

Newsletter

Sign up with your email to join our mailing list.

1041 Redirect Link Form Submitted Successfully!

Loading form...

If this message remains, it may be due to cookies being disabled or to an ad blocker.

  • Facebook Logo Icon
  • Twitter Logo Icon
  • Instagram Logo Icon
  • LinkedIn Logo Icon
  • Youtube Logo Icon
Pluralsight Logo Copyright © 2004 - 2023 Pluralsight LLC. All rights reserved
  • Terms of Use
  • Privacy Notice
  • Modern Slavery Act Transparency Statement