Course Overview
Gain the skills and knowledge necessary to implement and automate a data quality assurance process using the Informatica Data Quality platform, with profile and mapping execution on a Hadoop Spark environment. In addition to learning how to cleanse, standardize, and enhance data, students will learn to test and troubleshoot their Data Quality solutions. This course is applicable to version 10.5.1.
Objectives
After successfully completing this course, students should be able to:
- Describe the overall Data Quality Management Process
- Illustrate and discuss Data Engineering Quality Architecture and Runtime Environments
- Describe how to run mappings on the Spark and Blaze Engines
- Differentiate between the Analyst and Developer Roles and Tools
- Navigate the Developer tool and collaborate on projects with team members
- Perform Column, Rule, Multi object, Comparative, and Mid-Stream Profiling
- Manage Reference Tables
- Develop standardization, cleansing, and parsing Mappings and Mapplets
- Identify duplicate records using Classic Data Matching
- Create and execute Workflows to populate user inboxes with Exception and Duplicate record tasks
- Describe the deployment options that are available when executing Mappings outside of Informatica Developer
- Troubleshoot issues that may appear during development
Target Audience
- Developer
- Data Analyst
- Data Scientist
- Data Steward
Prerequisites
- None