- Lab
-
Libraries: If you want this lab, consider one of these libraries.
- Cloud
- Data

Querying Data in Amazon S3 with Amazon Athena
Welcome to this hands-on AWS lab, where we'll query data stored in Amazon S3 with SQL queries in Amazon Athena. Let's get started!

Lab Info
Table of Contents
-
Challenge
Create a Table from S3 Bucket Metadata
- Navigate to Amazon Athena.
- Configure settings to send query results to the S3 bucket and enable
Assign bucket owner full control over query results
. - Create a table from the S3 bucket data using the following values:
- Database: aws_service_logs
- Table Name: cf_access_optimized
- Location of Input Data Set: s3://<S3_BUCKET_NAME>/
- Data Format: Parquet
- Bulk add columns using this data:
time timestamp, location string, bytes bigint, requestip string, method string, host string, uri string, status int, referrer string, useragent string, querystring string, cookie string, resulttype string, requestid string, hostheader string, requestprotocol string, requestbytes bigint, timetaken double, xforwardedfor string, sslprotocol string, sslcipher string, responseresulttype string, httpversion string
- Create the following partitions:
- Column Name: year
- Column Name: month
- Column Name: day
- Click Create table.
-
Challenge
Add Partition Metadata
-
Open a new query tab.
-
Run the following query:
MSCK REPAIR TABLE aws_service_logs.cf_access_optimized
-
Observe that the
row count
equals207535
with the following query:SELECT count(*) AS rowcount FROM aws_service_logs.cf_access_optimized
-
Verify the partitions were created with the following query:
SELECT * FROM aws_service_logs.cf_access_optimized order by time desc LIMIT 10
-
-
Challenge
Query the Total Bytes Served in a Date Range
-
Observe the
bytes
column from the following query:SELECT * FROM aws_service_logs.cf_access_optimized WHERE time BETWEEN TIMESTAMP '2018-11-02' AND TIMESTAMP '2018-11-03'
-
Run the following query:
SELECT SUM(bytes) AS total_bytes FROM aws_service_logs.cf_access_optimized WHERE time BETWEEN TIMESTAMP '2018-11-02' AND TIMESTAMP '2018-11-03'
-
Observe the value for
total_bytes
equals87310409
.
-
About the author
Real skill practice before real-world application
Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.
Learn by doing
Engage hands-on with the tools and technologies you’re learning. You pick the skill, we provide the credentials and environment.
Follow your guide
All labs have detailed instructions and objectives, guiding you through the learning process and ensuring you understand every step.
Turn time into mastery
On average, you retain 75% more of your learning if you take time to practice. Hands-on labs set you up for success to make those skills stick.