Processing Unstructured and Semi-structured Data with Intelligent Structure Model Overview
Processing Unstructured and Semi-structured Data with
Intelligent Structure Model
Overview
You can use CLAIRE™
Intelligent Structure Discovery
to parse semi-structured or structured data in mappings that run on the Spark engine.
Long, complex files with little or no structure can be difficult to understand much less parse. CLAIRE™
Intelligent Structure Discovery
can automatically discover the structure in unstructured data.
CLAIRE™ uses machine learning algorithms to decipher data in semi-structured or unstructured data files and create a model of the underlying structure of the data. You can generate an
Intelligent structure model
, a model of the pattern, repetitions, relationships, and types of fields of data discovered in a file, in
Informatica Intelligent Cloud Services
.
To use the model, you export it from
Data Integration
, and then can associate it with a data object in a Big Data Management mapping. You can run the mapping on the Spark engine to process the data. The mapping uses the
Intelligent structure model
to extract and parse data from input files based on the structure expressed in the model.