Table of Contents

Search

  1. Preface
  2. Introduction to Informatica MDM - Relate 360
  3. Linking Batch Data
  4. Tokenizing Batch Data
  5. Processing Streaming Data
  6. Creating Relationship Graph
  7. Loading Linked and Consolidated Data into Hive
  8. Searching Data
  9. Monitoring the Batch Jobs
  10. Troubleshooting
  11. Glossary

User Guide

User Guide

Creating Relationship Graph

Creating Relationship Graph

A business entity type is a set of similar type of input data. The input data can be customer data, transaction data, product data, and other types of data. You can customize the
PZMAP
section of the configuration file and the match rule sets in the matching rules file for each type of input data. You can link or tokenize the input data based on the type of data.
For example, you can link the customer data to identify the household relationships. You do not require to link the transaction data. In this case, link the customer data and load the linked data into the repository, and tokenize the transaction data and load the tokenized data into the repository.
The following image shows the batch jobs that you can run to create the relationship graph:
If you want to link the input data, run the initial clustering, load clustering, and load match pairs jobs. If you do not want to link the input data, run the repository tokenization job. Similarly, process other types of data and load the processed data into the repository. Run the create relationship job to create relationship between two business entity types. Similarly, run the create relationship job and create relationships between other business entity types.
To create a relationship graph, perform the following tasks:
  1. If you want to link the input data, perform the following tasks:
    1. Run the initial clustering job.
      The job links the input data and creates linked and match-pair data in HDFS.
    2. Run the load clustering job.
      The job creates required tables in the repository and loads the linked data into the tables.
    3. Run the load match pairs job.
      The job loads the match-pair data of the business entity type into the repository.
  2. If you do not want to link the input data, run the repository tokenization job.
    The job tokenizes the input data, creates required tables in the repository, and loads the tokenized data into the tables.
  3. Similarly, process other business entity types and load the processed data into the repository.
  4. Run the create relationship job.
    The job creates relationship between two entities.
  5. Similarly, run the create relationship job for other entities to create the relationship graph.

0 COMMENTS

We’d like to hear from you!