View Course Agenda

Enterprise Data Catalog: Configuration and Maintenance

Instructor Led | Big Data | 3 Days | Version 10.2.1

Enterprise Data Catalog: Configuration and Maintenance

Course Overview

This course is applicable for software version 10.2.1. Gain the skills and knowledge necessary to install, configure, and maintain an Enterprise Data Catalog (EDC) environment. Using the Catalog Administrator, learn to manage and monitor resources, schedules, attributes, and connections for initial implementation and ongoing system maintenance.

Enroll Now 

Objectives

After successfully completing this course, students should be able to:

  • Install and Configure EDC considering the sizing requirements
  • Use the Catalog Administrator interface
  • Scan resources to obtain datasets
  • Manage Resources, Schedules, Attributes, Synonyms and Connections
  • Configure reusable settings
  • Manage Data Domains and Composite Data Domains
  • Extract metadata from data sources using the Universal Connectivity Framework
  • Create Custom models and Custom resource types
  • Monitor and Troubleshoot EDC
  • Use REST APIs

Target Audience

  • Administrator
  • Architect
  • Developer

Prerequisites

Agenda

Module 1: Overview of Enterprise Data Catalog

  • Major Business Challenges
  • EDC as a Solution
  • Key capabilities of EDC
  • Metadata and Metadata Management
  • EDC architecture
  • EDC features
  • EDC concepts
  • Catalog administration tasks
  • Catalog Administrator workspaces

Module 2: EDC Pre-Installation

  • Installation overview
  • Installation Phases
  • Perform Pre-installation steps
  • Deployment Methods 

Module 3: Installation

  • Pre-installation checklist
  • Installation Files
  • Installation modes
  • EDC Installation
  • Installation in Silent Mode
  • Post-installation phases
  • Uninstallation steps

Module 4: Resource Creation and Security

  • Overview of resources and scanners
  • Creation of Users
  • Create and scan resources:
    • Oracle
    • PowerCenter
    • Hive
    • Business Glossary
    • Informatica Platform
  • Supported File System and File Formats
  • Resource Security
  • Lab: Creating Users in Informatica Administrator
  • Lab: Creating New Oracle Resources
  • Lab: Creating a New PowerCenter Resource
  • Lab: Creating Oracle Resources from a Different Schema
  • Lab: Creating a Hive Resource
  • Lab: Creating a New Business Glossary Resource
  • Lab: Creating a BDM Resource Type
  • Lab: Creating an Avro Resource
  • Lab: Configuring Permissions for Resources

Module 5: Resource Management

  • Connections Management
  • Connection types
  • Profile Configuration Management
  • Data Similarity
  • Reusable Data Integration Service (DIS) configuration
  • Schedule Management
  • Lab: Managing PowerCenter Connections
  • Lab: Managing BDM Connections
  • Lab: Profiling and Data Discovery
  • Lab: Setting up a Reusable Data Integration Service Configuration
  • Lab: Creating a Schedule

Module 6: Data Domains

  • Data Domain Discovery
  • Data Domain Discovery Types
  • Supported Resource Types for Data Discovery
  • Data Domains and Data Domain groups
  • Data Domain Curation
  • Data Domain Inference
  • Data Domain Propagation
  • Composite Data Domains
  • Smart Domains
  • Lab: Creating Rule-Based Data Domains
  • Lab: Creating Data Domain Group
  • Lab: Creating Smart Domains
  • Lab: Creating Composite Data Domains
  • Lab: Curating Data Domain

Module 7: Attribute Management and Synonyms

  • System and Custom attributes
  • Attribute properties
  • Edit system attributes
  • Create and use custom attributes
  • Synonym definition files
  • Upload the synonym definition file in Catalog Administrator
  • Lab: Editing System Attributes
  • Lab: Creating a Custom Attribute
  • Lab: Loading Synonyms

Module 8: Universal Connectivity Framework

  • Metadata Models
  • Universal connectivity framework
  • Supported metadata sources
  • Creating resource types
  • Creating resource for the defined resource type
  • Lab: Creating a Resource Based on a Universal Resource Type

Module 9: Custom Models and Resources

  • Types of metadata models
  • Custom scanner framework
  • Custom metadata integration
  • Create and manage custom model
  • Create the custom resource type
  • Create the custom resource
  • Custom Scanners
  • Extracting metadata from custom scanners
  • Metadata Ingestion
  • Lab: Creating a Custom Scanner

Module 10: Performance Tuning

  • Performance tuning stages and parameters
  • EDC sizing recommendations
  • Tuning performance based on the size of the data
  • Tuning performance for similarity
  • Tuning profile warehouse
  • Data integration service system requirements for profiling
  • Tuning for profiling performance
  • Data integration service parameters
  • Profile configuration in data integration service
  • Data integration service profiling properties
  • Lab: Tuning Performance Based on the Size of the Data

Module 11: Monitoring and Troubleshooting Enterprise Data Catalog

  • Monitor resources and tasks
  • Manage tasks
  • Apply filters to monitor tasks
  • Troubleshoot errors in EDC
  • Lab: Monitoring Catalog Administrator

Module 12: REST APIs

  • REST API Overview
  • Concepts Exposed by REST API
  • HTTP Methods
  • Cataloging with REST APIs
  • Resource Execution API
  • Resource Information API
  • Model Modification API
  • Lab: Running and Monitoring a Resource Scan
  • Lab: Creating a Custom Attribute
 
Enroll Now 

Back to Course Overview


QUESTIONS?

Instructor Led | Big Data | 3 Days | Version 10.2.1

Print Friendly and PDF