Table of Contents

Search

  1. Preface
  2. Introduction to Informatica MDM - Relate 360
  3. Linking Batch Data
  4. Tokenizing Batch Data
  5. Processing Streaming Data
  6. Creating Relationship Graph
  7. Loading Linked and Consolidated Data into Hive
  8. Searching Data
  9. Monitoring the Batch Jobs
  10. Troubleshooting
  11. Glossary

User Guide

User Guide

Tokenization Process

Tokenization Process

You can tokenize batch data or streaming data. To tokenize batch data, store the input data in HDFS.
Relate 360
reads the input data from HDFS and tokenizes the input data. You can then persist the tokenized data in a repository. To tokenize streaming data, stream the input data in the JSON format.
Relate 360
tokenizes the input data and persists the tokenized data in a repository.
The tokenized data contains input data and encoded tokens for the input data. You can perform searches on the tokenized data.

0 COMMENTS

We’d like to hear from you!