View Course Agenda

Enterprise Data Catalog: Configuration and Maintenance

onDemand | Big Data | Self-Paced | Version 10.2

Enterprise Data Catalog: Configuration and Maintenance

Course Overview

This course is applicable for software version 10.4. Gain the required skills and knowledge necessary to install, configure, and maintain an Enterprise Data Catalog (EDC) environment. Using the Catalog Administrator, learn to manage and monitor resources, schedules, attributes, and connections for initial implementation and ongoing system maintenance.

Enroll Now 

Objectives

After successfully completing this course, students should be able to:

  • Scan resources to obtain datasets
  • Enable profiling in resources
  • Manage resources, schedules, attributes, synonyms, and connections
  • Configure reusable settings
  • Manage data domains and composite data domains
  • Monitor and troubleshoot EDC
  • Create custom models and custom resource types

Target Audience

  • Administrator
  • Architect
  • Developer

Prerequisites

  • None

Agenda

Module 1: Enterprise Data Catalog-Overview

  • Major Business Challenges
  • EDC as a Solution
  • Key capabilities of EDC
  • Metadata and Metadata Management
  • EDC architecture
  • EDC features
  • EDC concepts
  • Catalog administration tasks
  • Catalog Administrator workspaces

Module 2: EDC Pre-Installation

  • Installation overview
  • Installation Phases
  • Perform Pre-installation steps
  • Deployment Methods 

Module 3: EDC Installation

  • Pre-installation checklist
  • Installation Files
  • Installation modes
  • EDC Installation
  • Installation in Silent Mode
  • Post-installation phases
  • Uninstallation steps

Module 4: Resource Creation

  • Overview of resources and scanners
  • Creation of Users
  • Create and scan resources:
    • Oracle
    • PowerCenter
    • Hive
    • Business Glossary
    • Axon
    • Informatica Platform
  • Supported File System and File Formats
  • Lab: Creating Users in Informatica Administrator
  • Lab: Creating New Oracle Resources
  • Lab: Creating a New PowerCenter Resource
  • Lab: Creating Oracle Resources from a Different Schema
  • Lab: Creating a Hive Resource
  • Lab: Creating a New Business Glossary Resource
  • Lab: Creating an Axon Resource
  • Lab: Creating an Informatica Platform Resource
  • Lab: Creating an Avro Resource

Module 5: Security

  • Resource Security
  • Users and Groups Permissions
  • Lab: Configuring Permissions for Resources

Module 6: Resource Management

  • Connections Management
  • Connection types
  • Profile Configuration Management
  • Metadata and Data Profile Filters
  • Unique Key Inference
  • Business Term Propagation
  • Reusable Data Integration Service (DIS) configuration
  • Schedule Management
  • Lab: Managing PowerCenter Connections
  • Lab: Managing Informatica Platform Connections
  • Lab: Profiling and Data Discovery
  • Lab: Setting up a Reusable Data Integration Service Configuration
  • Lab: Creating a Schedule
  • Lab: Enabling Metadata Filter
  • Lab: Enabling Business Term Association
  • Lab: Enabling Reference Resources

Module 7: Data Domains

  • Data Domain Discovery
  • Data Domain Discovery Types
  • Supported Resource Types for Data Discovery
  • Data Domains and Data Domain groups
  • Data Domain Curation
  • Data Domain Inference
  • Data Domain Propagation
  • Composite Data Domains
  • Smart Domains
  • Data Similarity
  • Lab: Creating Rule-Based Data Domains
  • Lab: Creating Data Domain Group
  • Lab: Creating Smart Domains
  • Lab: Creating Composite Data Domains
  • Lab: Data Domain Curation

Module 8: Attribute Management and Synonyms

  • System and Custom attributes
  • Attribute properties
  • Edit system attributes
  • Create and use custom attributes
  • Synonym definition files
  • Upload the synonym definition file in Catalog Administrator
  • Lab: Creating a Custom Attribute
  • Lab: Loading Synonyms

Module 9: Custom Models and Resources

  • Types of metadata models
  • Custom scanner framework
  • Custom metadata integration
  • Create and manage custom model
  • Create the custom resource type
  • Create the custom resource
  • Custom Scanners
  • Extracting metadata from custom scanners
  • Metadata Ingestion
  • Lab: Creating a Custom Scanner

Module 10: Monitoring and Troubleshooting EDC

  • Monitor resources and tasks
  • Manage tasks
  • Apply filters to monitor tasks
  • Troubleshoot errors in EDC

Module 11: REST APIs

  • REST API Overview
  • Concepts Exposed by REST API
  • HTTP Methods
  • Cataloging with REST APIs
  • Lineage Filter APIs
  • Resource Execution API
  • Resource Information API
  • Model Modification API
  • Lab: Running and Monitoring a Resource Scan
  • Lab: Creating a Custom Attribute
  • Lab: Creating New Search Tab
  • Lab: Listing Inferred Business Terms
  • Lab: Creating Lineage Filter

Module 12: Performance Tuning

  • Performance tuning stages and parameters
  • EDC sizing recommendations
  • Tuning performance based on the size of the data
  • Tuning performance for similarity
  • Tuning profile warehouse
  • Data integration service system requirements for profiling
  • Tuning for profiling performance
  • Data integration service parameters
  • Profile configuration in data integration service
  • Data integration service profiling properties
  • Lab: Performance Tuning using infacmd autotune command

 
Enroll Now 

Back to Course Overview


QUESTIONS?

onDemand | Data Engineering | Self-Paced | Version 10.4

Print Friendly and PDF