menu
arrow_back

Data Catalog: Qwik Start

—/100

Checkpoints

arrow_forward

Create a dataset

Copy a public New York taxi table to your dataset

Create a tag template and attach the tag to your table

Data Catalog: Qwik Start

30 Minuten 1 Guthabenpunkt

GSP729

Google Cloud Self-Paced Labs

Overview

Data Catalog is a fully managed and scalable metadata management service that empowers organizations to quickly discover, understand, and manage all their data.

It offers a simple and easy-to-use search interface for data discovery, a flexible and powerful cataloging system for capturing both technical and business metadata, and a strong security and compliance foundation with Cloud Data Loss Prevention (DLP) and Cloud Identity and Access Management (IAM) integrations.

Google BigQuery is an enterprise data warehouse that enables super-fast SQL queries using the processing power of Google's infrastructure.

Simply move your data into BigQuery and let us handle the hard work. You can control access to both the project and your data based on your business needs, such as giving others the ability to view or query your data.

Using Data Catalog

There are two main ways you interact with Data Catalog:

  • Searching for data assets that you have access to.

  • Tagging assets with metadata.

Data Catalog use case

Imagine you are a data engineer in your company. It is your job to ensure all datasets can be easily discovered and used by colleagues, such as data scientists or business analysts. When a new dataset comes in, you annotate it with important information—this could be whether or not it contains PII data, who owns the dataset, how many rows the dataset contains, etc.

You can annotate this information by adding tags to your dataset and tables. Data Catalog allows you to create tag templates to let you define what kind of attributes you want to tag. This allows you to easily access, map, and discover pertinent information from your datasets and tables.

What you will learn

In this lab, you will learn how to:

  • Enable the Data Catalog API so that you can use this service in your Google Cloud project.

  • Create a dataset with BigQuery.

  • Copy a public New York Taxi table to your dataset.

  • Create a Data Catalog tag template.

  • Tag your newly created table with the newly created tags.

Prerequisites

Very Important: Before starting this lab, log out of your personal or corporate gmail account, or run this lab in Incognito. This prevents sign-in confusion while the lab is running.

Wenn Sie sich in Qwiklabs anmelden, erhalten Sie Zugriff auf den Rest des Labs – und mehr!

  • Sie erhalten vorübergehenden Zugriff auf Google Cloud Console.
  • Mehr als 200 Labs für Einsteiger und Experten.
  • In kurze Sinneinheiten eingeteilt, damit Sie in Ihrem eigenen Tempo lernen können.
Beitreten, um dieses Lab zu starten