CloudBrain Training Logo

Google Cloud Big Data and Machine Learning Fundamentals

(1 day)

This course introduces the Google Cloud big data and machine learning products and services that support the data-to-AI lifecycle. It explores the processes, challenges, and benefits of building a big data pipeline and machine learning models with Vertex AI on Google Cloud.

Course Objectives

  • Recognize the data-to-AI lifecycle on Google Cloud and the major products of big data and machine learning.
  • Design streaming pipelines with Dataflow and Pub/Sub.
  • Analyze big data at scale with BigQuery.
  • Identify different options to build machine learning solutions on Google Cloud.
  • Describe a machine learning workflow and the key steps with Vertex AI.
  • Build a machine learning pipeline using AutoML.

Audience

  • Data analysts, data scientists, and business analysts who are getting started with Google Cloud
  • Individuals responsible for designing pipelines and architectures for data processing, creating and maintaining machine learning and statistical models, querying datasets, visualizing query results, and creating reports
  • Executives and IT decision makers evaluating Google Cloud for use by data scientists

Prerequisites

Basic understanding of one or more of the following:

  • Database query language such as SQL
  • Data engineering workflow from extract, transform, load, to analysis, modeling, and deployment
  • Machine learning models such as supervised versus unsupervised models

Course Outline

The course includes presentations, demonstrations, and hands-on labs.

Module 1: Big Data and Machine Learning on Google Cloud

  • Identify the different aspects of Google Cloud’s infrastructure
  • Identify the big data and machine learning products on Google Cloud

Module 2: Data Engineering for Streaming Data

  • Describe an end-to-end streaming data workflow from ingestion to data visualization
  • Identify modern data pipeline challenges and how to solve them at scale with Dataflow
  • Build collaborative real-time dashboards with data visualization tools

Module 3: Big Data with BigQuery

  • Describe the essentials of BigQuery as a data warehouse
  • Explain how BigQuery processes queries and stores data
  • Define BigQuery ML project phases
  • Build a custom machine learning model with BigQuery ML

Module 4: Machine Learning Options on Google Cloud

  • Identify different options to build ML models on Google Cloud
  • Define Vertex AI and its major features and benefits
  • Describe AI solutions in both horizontal and vertical markets

Module 5: The Machine Learning Workflow with Vertex AI

  • Describe a ML workflow and the key steps
  • Identify the tools and products to support each stage
  • Build an end-to-end ML workflow using AutoML

Module 6: Summary

  • Recap of key learning points
  • Resources