Table of Contents

Search

  1. Preface
  2. Introduction to Informatica MDM - Relate 360
  3. Linking Batch Data
  4. Tokenizing Batch Data
  5. Processing Streaming Data
  6. Creating Relationship Graph
  7. Loading Linked and Consolidated Data into Hive
  8. Searching Data
  9. Monitoring the Batch Jobs
  10. Troubleshooting
  11. Glossary

User Guide

User Guide

Loading Consolidated Data from the Repository

Loading Consolidated Data from the Repository

Use the Hive enabler job to link a Hive table to the preferred records table in the repository. If you link a Hive table to the preferred records table in the repository, the data continues to persist in the repository, and the Hive table accesses data from the repository. You do not have to run the Hive enabler job again for the incremental data.
The following image shows how the Hive enabler job links the Hive table to the repository table:
The Hive enabler job links the Hive table with the preferred records table.
In the configuration file, if you set
StoreAllFields
to false, the repository does not persist all the columns but persists only the columns that you use to index the input data. If you want to view all the columns in Hive, ensure that you set
StoreAllFields
to true in the configuration file when you link the input data.

0 COMMENTS

We’d like to hear from you!