Data Engineering on Google Cloud Platform Training Logo

Data Engineering on Google Cloud Platform Training

Live Online & Classroom Enterprise Certification Training

Powered By

Google Cloud Platform Logo

This four-day instructor-led GCP Data Engineering training class provides participants a hands-on introduction to designing and building data processing systems on Google Cloud Platform.

ATP_Authorized Logo

Powered By

Google Cloud Platform Logo

Looking for a private batch ?

REQUEST A CALLBACK

Need help finding the right training?

Your Message

  • Certified Trainer

  • Authorized Courseware

  • Completion Certificate from ATP

  • Enterprise Reporting

  • Lifetime Access

  • CloudLabs

  • 24x7 Support

  • Real-time code analysis and feedback

What is GCP Data Engineer Certification Training about?

GCP Data Engineering Certification training offers a combination of presentations, demos, and hand-on labs, through which participants will learn how to design data processing systems, build end-to-end data pipelines, analyze data and carry out machine learning. This GCP Data Engineering course covers structured, unstructured, and streaming data.

Who is GCP Data Engineer Certification Training for?

This GCP Data Engineering Certification training class is intended for


  • Experienced developers who are responsible for managing big data transformations including: Extracting, Loading, Transforming, cleaning, and validating data
  • Designing pipelines and architectures for data processing
  • Creating and maintaining machine learning and statistical models Querying datasets, visualizing query results and creating reports 

What are the prerequisites for GCP Data Engineer Certification Training?

To get the most of out of this course, participants should have:


  • Completed Google Cloud Fundamentals: Big Data & Machine Learning course OR
  • Have equivalent experience Basic proficiency with common query language such as SQL Experience with data modeling, extract, transform, load activities
  • Developing applications using a common programming language such as Python Familiarity with Machine Learning and/or statistics

Available Training Modes

Live Online Training

24 Hours

Classroom Training

4 Days

Course Outline Expand All

Expand All

  • Creating and managing clusters.
  • Leveraging custom machine types and preemptible worker nodes.
  • Scaling and deleting Clusters.
  • Lab: Creating Hadoop Clusters with Google Cloud Dataproc.
  • Running Pig and Hive jobs.
  • Separation of storage and compute.
  • Lab: Running Hadoop and Spark Jobs with Dataproc.
  • Lab: Submit and monitor jobs.
  • Customize cluster with initialization actions.
  • BigQuery Support.
  • Lab: Leveraging Google Cloud Platform Services.
  • Google’s Machine Learning APIs.
  • Common ML Use Cases.
  • Invoking ML APIs.
  • Lab: Adding Machine Learning Capabilities to Big Data Analysis.
  • What is BigQuery.
  • Queries and Functions.
  • Lab: Writing queries in BigQuery.
  • Loading data into BigQuery.
  • Exporting data from BigQuery.
  • Lab: Loading and exporting data.
  • Nested and repeated fields.
  • Querying multiple tables.
  • Lab: Complex queries.
  • Performance and pricing.
  • The Beam programming model.
  • Data pipelines in Beam Python.
  • Data pipelines in Beam Java.
  • Lab: Writing a Dataflow pipeline.
  • Scalable Big Data processing using Beam.
  • Lab: MapReduce in Dataflow.
  • Incorporating additional data.
  • Lab: Side inputs.
  • Handling stream data.
  • GCP Reference architecture.
  • What is machine learning (ML).
  • Effective ML: concepts, types.
  • ML datasets: generalization.
  • Lab: Explore and create ML datasets.
  • Getting started with TensorFlow.
  • Lab: Using tf.learn.
  • TensorFlow graphs and loops + lab.
  • Lab: Using low-level TensorFlow + early stopping.
  • Monitoring ML training.
  • Lab: Charts and graphs of TensorFlow training.
  • Why Cloud ML?
  • Packaging up a TensorFlow model.
  • End-to-end training.
  • Lab: Run a ML model locally and on cloud.
  • Creating good features.
  • Transforming inputs.
  • Synthetic features.
  • Preprocessing with Cloud ML.
  • Lab: Feature engineering.
  • Stream data processing: Challenges.
  • Handling variable data volumes.
  • Dealing with unordered/late data.
  • Lab: Designing streaming pipeline.
  • What is Cloud Pub/Sub?
  • How it works: Topics and Subscriptions.
  • Lab: Simulator.
  • Challenges in stream processing.
  • Handle late data: watermarks, triggers, accumulation.
  • Lab: Stream data processing pipeline for live traffic data.
  • Streaming analytics: from data to decisions.
  • Querying streaming data with BigQuery.
  • What is Google Data Studio?
  • Lab: build a real-time dashboard to visualize processed data.
  • What is Cloud Spanner?
  • Designing Bigtable schema.
  • Ingesting into Bigtable.
  • Lab: streaming into Bigtable.

Who is the instructor for this training?

The trainer for this Google Cloud Platform training has extensive experience in this domain, including years of experience training & mentoring professionals.

Course Logo

GCP Data Engineer Certification Training - Certification & Exam

  • SpringPeople is the Authorized Training Partner of Google Cloud Platform
  • Data Engineering on Google Cloud Platform training prepares you for  Professional Data Engineer Certification Exam
  • The training fees is exclusive of exam cost.
  • For any queries; feel free to reach us at gcp@springpeople.com

Reviews