Data Science on the Google Cloud Platform

Advanced 10 Steps 1 day 60 크레딧

This is the first of two Quests of hands-on labs is derived from the exercises from the book Data Science on Google Cloud Platform by Valliappa Lakshmanan, published by O'Reilly Media, Inc. In this first Quest, covering up through chapter 8, you are given the opportunity to practice all aspects of ingestion, preparation, processing, querying, exploring and visualizing data sets using Google Cloud Platform tools and services.

Data Machine Learning


This Quest assumes you have access to the O’Reilly book Data Science on the Google Cloud Platform as the labs only include the exercises from the end of each chapter and do not contain the concepts or teaching from the text itself. The labs use GCP Services and Tools for data storage, transformation and warehousing, so it is recommended that the student also has earned Badges for the Baseline: Data, ML, and AI and the GCP Essentials Quests before beginning.

Quest Outline


BigQuery 및 Cloud SQL용 SQL 소개

이 실습에서는 기본적인 SQL 절을 학습하고 BigQuery 및 Cloud SQL에서 구조화된 쿼리를 실행하는 연습을 진행합니다.

Deutsch English español (Latinoamérica) français bahasa Indonesia 日本語 한국어 português (Brasil)

Ingesting Data Into The Cloud

In this lab you'll learn how to use a bash script to download selected data from a large public data set that is available on the internet.


warning Ingesting Data Into The Cloud Using Google App Engine

This lab demonstrates how to use local Python scripts to retrieve data from the US Bureau of Transport Statistics website, then modify the data so they can be run using Google App Engine.


Loading Data into Google Cloud SQL

In this lab you will import data from CSV text files into Cloud SQL and then carry out some basic data analysis using simple queries.


Visualizing Data with Google Data Studio

This lab demonstrates how to use Google Data Studio to visualize data stored in Google Cloud SQL.


Processing Data with Google Cloud Dataflow

In this lab you will simulate a real-time real world data set from a historical data set. This simulated data set will be processed from a set of text files using Python and Google Cloud DataFlow, and the resulting simulated real-time data will be stored in Google BigQuery.


Visualize Real Time Geospatial Data with Google Data Studio

Use Google Dataflow to process real-time streaming data from a real-time real world historical data set, storing the results in Google BigQuery and then using Google Data Studio to visualize real-time geospatial data.


Loading Data into Google BigQuery for Exploratory Data Analysis

You will learn how to load text data into Google BigQuery and then use that data for rapid exploratory data analysis using Google Cloud Datalab notebooks.


Exploratory Data Analysis Using AI Platform

Learn the process of analyzing a data set stored in BigQuery using AI Platform to perform queries and present the data using various statistical plotting techniques.


Evaluating a Data Model

Learn the process for partitioning a data set into a training set that will be used to develop a model, and a test set that can then be used to evaluate the accuracy of the model and then independently evaluate predictive models in a repeatable manner.

지금 등록

배지 획득에 대한 진행 상황을 추적하려면 이 퀘스트에 등록하세요.