You can tokenize batch data or streaming data. To tokenize batch data, store the input data in HDFS.
Relate 360
reads the input data from HDFS and tokenizes the input data. You can then persist the tokenized data in a repository. To tokenize streaming data, stream the input data in the JSON format.
Relate 360
tokenizes the input data and persists the tokenized data in a repository.
The tokenized data contains input data and encoded tokens for the input data. You can perform searches on the tokenized data.