- Lab
-
Libraries: If you want this lab, consider one of these libraries.
- Cloud
- Data

Running a Spark Job on a Dataproc Cluster
Google Cloud Dataproc is a fully managed and highly scalable service capable of running Apache Spark as well as over 30 other open-source tools and frameworks. In this hands-on lab, you’ll provision a Dataproc cluster and then submit a Spark job that calculates and outputs the value of Pi to a high degree.

Lab Info
Table of Contents
-
Challenge
Enable the Necessary API
Enable the Dataproc API, either through the user interface or the Cloud Shell.
-
Challenge
Create a Dataproc Cluster
Provision a Dataproc cluster capable of running the Spark job.
-
Challenge
Submit a Spark Job
Submit the Spark job, and evaluate the results.
About the author
Real skill practice before real-world application
Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.
Learn by doing
Engage hands-on with the tools and technologies you’re learning. You pick the skill, we provide the credentials and environment.
Follow your guide
All labs have detailed instructions and objectives, guiding you through the learning process and ensuring you understand every step.
Turn time into mastery
On average, you retain 75% more of your learning if you take time to practice. Hands-on labs set you up for success to make those skills stick.