Course

Skills

Architecting Data Warehousing Solutions Using Google BigQuery

by Janani Ravi

BigQuery is the Google Cloud Platform’s data warehouse on the cloud. In this course, you’ll learn how you can work with BigQuery on huge datasets with little to no administrative overhead.

Preview this course

Try for free

Get this course plus top-rated picks in tech skills and other popular topics.

$29.00

per month after 10 day trial

Your 10 day Standard free trial includes

Expert-led courses

Keep up with the pace of change with thousands of expert-led, in-depth courses.

For teams

Give up to 50 users access to our full library including this course free for 30 days

Course info

Rating

(38)

Level

Beginner

Updated

Jun 20, 2024

Duration

2h 48m

What you'll learn

Organizations store massive amounts of data that gets collated from a wide variety of sources. BigQuery supports fast querying at a petabyte scale, with serverless functionality and autoscaling. BigQuery also supports streaming data, works with visualization tools, and interacts seamlessly with Python scripts running from Datalab notebooks.

In this course, Architecting Data Warehousing Solutions Using Google BigQuery, you’ll learn how you can work with BigQuery on huge datasets with little to no administrative overhead related to cluster and node provisioning.

First, you'll start off with an overview of the suite of storage products on the Google Cloud and the unique position that BigQuery holds. You’ll see how BigQuery compares with Cloud SQL, BigTable, and Datastore on the GCP and how it differs from Amazon Redshift, the data warehouse on AWS.

Next, you’ll create datasets in BigQuery which are the equivalent of databases in RDMBSes and create tables within datasets where actual data is stored. You’ll work with BigQuery using the web console as well as the command line. You’ll load data into BigQuery tables using the CSV, JSON, and AVRO format and see how you can execute and manage jobs.

Finally, you'll wrap up by exploring advanced analytical queries which use nested and repeated fields. You’ll run aggregate operations on your data and use advanced windowing functions as well. You’ll programmatically access BigQuery using client libraries in Python and visualize your data using Data Studio.

At the end of this course, you'll be comfortable working with huge datasets stored in BigQuery, executing analytical queries, performing analysis, and building charts and graphs for your reports.

Course Overview

2mins

Course Overview 2m

Understanding BigQuery in the GCP Service Taxonomy

42mins

Using Datasets, Tables, and Views in BigQuery

30mins

Module Overview 1m
Datasets Tables and Views 5m
Creating and Editing Access to a Dataset 5m
Verifying Access to Datasets 2m
Creating and Querying Tables 5m
Creating Tables from Other Source Tables 7m
Creating Views 2m
Authorized Views 4m

Getting Data in and out of BigQuery

27mins

Module Overview 1m
Uploading Data to GCS Buckets 3m
Importing Data from CSV Files on Cloud Storage 4m
Configuring Additional Settings While Importing JSON Files 3m
Creating External Tables with Data in GCS Buckets 4m
Partitioning in BigQuery 4m
Creating and Querying Ingestion Time Partitioned Tables 6m
Creating Column Based Partitioned Tables Using the Command Line 4m

Performing Advanced Analytical Queries in BigQuery

46mins

Module Overview 1m
Normalized Storage in a Traditional Database 3m
Denormalized Storage, Nested, and Repeated Fields 4m
The UNNEST, ARRAY_AGG, and the STRUCT Operators 3m
Working with Nested Fields 3m
Populating Data into a Table with Nested Fields Using STRUCT 4m
Working with Repeated Fields 3m
Populating Tables with Repeated Fields Using ARRAY_AGG 3m
Using Nested and Repeated Fields Together 3m
Using UNNEST to Query Repeated Fields 3m
Aggregations 4m
Subqueries 3m
Windowing Operations 4m
Performing Window Operations Using "Partition By" and "Order By" 4m
Windowing Operations Using a Window Range 3m

Programmatically Accessing BigQuery from Client Programs

19mins

Module Overview 1m
Integrating BigQuery with Data Studio 7m
Connecting to Datalab 3m
Running Queries Programmatically 6m
Summary and Further Study 1m

About the author

Janani Ravi

Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework. After spending years working in tech in the Bay Area, New York, and Singapore at companies such as Microsoft, Google, and Flipkart, Janani finally decided to combine her love for technology with her passion for teaching. She is now the co-founder of Loonycorn, a content studio focused on providing ... more

See more courses by Janani Ravi

Try for free

Get this course plus top-rated picks in tech skills and other popular topics.

$29.00

per month after 10 day trial

Your 10 day Standard free trial includes

Expert-led courses

Keep up with the pace of change with thousands of expert-led, in-depth courses.

For teams

Give up to 50 users access to our full library including this course free for 30 days

Course info

Rating

(38)

Level

Beginner

Updated

Jun 20, 2024

Duration

2h 48m

Ready to upskill? Get started

Contact Sales

Architecting Data Warehousing Solutions Using Google BigQuery

What you'll learn

Table of contents

About the author

Ready to skill up
your entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Ready to skill up
your entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Support

Community

Company

Industries

Newsletter

Contact Sales

Architecting Data Warehousing Solutions Using Google BigQuery

What you'll learn

Table of contents

About the author

Get access now

Ready to skill upyour entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Ready to skill upyour entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Support

Community

Company

Industries

Newsletter

Ready to skill up
your entire team?

Ready to skill up
your entire team?