User Guide

Back Next

Tokenizing Batch Data

The following image shows how you can tokenize data in HDFS and persist the tokenized data in HDFS or in a repository:

To tokenize the input data, perform the following tasks:

Run the

Relate 360

batch jobs to read the input data in HDFS, add tokens to the input data, and persist the tokenized data in HDFS or in a repository.

To search the tokenized data for matching records, use the RESTful web services, and the batch job.

Watch

Comments

0 COMMENTS

We’d like to hear from you! Log in to comment.