- A Cloud Guru
Running a Spark Job on a Dataproc Cluster
Google Cloud Dataproc is a fully managed and highly scalable service capable of running Apache Spark as well as over 30 other open-source tools and frameworks. In this hands-on lab, you’ll provision a Dataproc cluster and then submit a Spark job that calculates and outputs the value of Pi to a high degree.
Table of Contents
Enable the Necessary API
Enable the Dataproc API, either through the user interface or the Cloud Shell.
Create a Dataproc Cluster
Provision a Dataproc cluster capable of running the Spark job.
Submit a Spark Job
Submit the Spark job, and evaluate the results.
What's a lab?
Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.