Building Batch Data Pipelines on Google Cloud

Seminar / Firmentraining

Zielgruppe

This course is intended for developers who are responsible for designing pipelines and architectures for data processing.

Voraussetzungen

  • Experience with data modeling and ETL (extract, transform, load) activities.
  • Experience with developing applications by using a common programming language such as Python or Java.

Inhalte

  • Review different methods of data loading: EL, ELT and ETL and when to use what.
  • Run Hadoop on Dataproc, use Cloud Storage, and optimize Dataproc jobs.
  • Build your data processing pipelines by using Dataflow.
  • Manage data pipelines with Data Fusion and Cloud Composer