Cloud Data Quality

onDemand | IDMC Data Quality | Self-Paced | Release 41

Course Overview

Learn the fundamentals of Informatica Intelligent Cloud Data Quality including the Cloud Architecture and GUI, Data Quality Assets/Transformations, and Cloud Mapping Designer. This course enables you to design and build your Data Quality Cloud Process for use in Data Migration, Data Integration, or Data Quality Projects. This course is applicable to Release 41.

Objectives

After successfully completing this course, students should be able to:

  • Describe Informatica Cloud Architecture
  • Download and install the Secure Agent
  • Describe what Cloud Data Quality is and how it can be used
  • Use Cloud Administrator to define Connections
  • Create Mappings using Cloud Mapping Designer
  • Profile data to identify anomalies
  • Create Dictionaries to hold reference data for verification and standardization routines
  • Use Rule Specifications to build rules to identify bad data
  • Describe the Scorecarding process
  • Identify and label data in fields using a Labeler Asset
  • Configure the Cleanse Asset to cleanse bad data identified during profiling
  • Configure the Parse Asset to parse data
  • Use the Deduplicate functionality to identify and consolidate duplicate records
  • Verify and enhance Addresses using the Verify Asset

Target Audience

  • Developer
  • Business User

Prerequisites

  • None
Agenda
Module 1: Informatica Cloud Services Overview
  • Introduction to Informatica Intelligent Cloud Services (IICS)
  • Informatica Cloud Terminology
  • Informatica Cloud Architecture
  • Informatica Cloud Services
  • Runtime Environments
  • Connections
  • The Administrator Service
  • Lab: Defining Connections
Module 2: Cloud Data Quality Overview
  • What is Data Quality?
  • Discuss the Data Quality Management Process Cycle
  • List and explain the Dimensions of Data Quality
  • Describe Data Quality functions, inputs, and outputs
  • Cloud Data Quality Services and Assets
Module 3: Cloud Mapping Designer
  • Cloud Mapping Designer Overview
  • Mapping Designer Terminologies
  • Mappings and Mapplets
  • Common Transformations
  • Lab: Create your training folder
  • Lab: Create and run a mapping to load data into a SQL table
Module 4: Cloud Data Profiling
  • Profile Data
  • Review Profiling Results and identify anomalies
  • Profile Features
  • Lab: Profiling Data
  • Lab: Profiling Insights
Module 5: Dictionaries
  • What are dictionaries and why are they used?
  • Creating dictionaries
  • Lab: Create a dictionary to standardize data
  • Lab: Copy and edit an existing dictionary to validate data
  • Lab: Create a dictionary to enhance data
Module 6: Rule Specifications
  • Introduction to rule specifications
  • Building rule specifications
  • Lab: Create a rule specification to validate the company field
  • Lab: Create a rule specification with multiple rules
Module 7: Scorecards
  • Scorecard Overview
  • Update a profile and define rule occurrences
  • Review Scorecards
  • Lab: Apply rules to a profile and review
Module 8: The Labeler Asset
  • Standardization Overview
  • Introduction to the Labeler Asset
  • Configuring a Labeler Asset in Token Labeler mode
  • Configuring a Labeler Asset in Character Labeler mode
  • Lab: Create a Labeler to mask nonnumeric data
Module 9: The Cleanse Asset
  • Introduction to the Cleanse Asset
  • Cleanse, standardize and enhance data
  • Build a mapping to cleanse and transform data
  • Lab: Create a mapplet to cleanse and standardize the Company name
  • Lab: Configure a mapplet to derive a Master Contact name
  • Lab: Configure a mapplet to remove noise from a numeric field
  • Lab: Configure a mapping to cleanse and standardize data
Module 10: The Parse Asset
  • Introduction to the Parse Asset
  • Parsing data
  • Lab: Configure the parse asset in prebuilt mode
  • Lab: Configure the parse asset using a regular expression
  • Lab: Update the Load Mapping to include both datasets
  • Lab: Reprofile and standardize the data
Module 11: The Deduplicate Asset
  • Introduction to the Deduplicate Asset
  • Matching Theory
  • Identify matching or related records
  • Configure the Deduplicate Asset to consolidate matched data
  • Lab: Configure a Deduplicate asset to identify duplicate or related records
  • Lab: Create a mapping to identify duplicate records
  • Lab: Update the deduplicate asset to consolidate matched records
Module 12: Verifier Asset
  • Introduction to the Verifier Asset
  • Verify Address Data
  • Lab: Configure the Verifier Asset to verify and correct US master records
 

Enroll Now

Back to Course Overview

Power User Axon for Community Users (Instructor Led or onDemand) Axon Content Curation (Instructor Led) Axon for Power Users (Instructor Led) Axon Data Governance (Professional Certification) Axon Data Governance (Professional Certification) Axon Data Governance (Professional Certification) Some more content to make this bigger asdf asdf asdf

Informatica offers programs to extend learning in convenient and economic packages. Programs include self-paced subscriptions as well as bundled instructor led training and certifications. Each program is curated around a specific skillset to enable customer success.

365University Data Governance Annual Subscription

Informatica MasterPass Education Subscription

Informatica Learning Library

Data Governance & Privacy Journey Master

View Full Course Offerings