to parse semi-structured or structured data in mappings that run on the Spark engine.
Long, complex files with little or no structure can be difficult to understand much less parse. CLAIRE
Intelligent Structure Discovery
can automatically discover the structure in unstructured data.
CLAIRE uses machine learning algorithms to decipher data in semi-structured or unstructured data files and create a model of the underlying structure of the data. You can generate an
Intelligent structure model
, a model of the pattern, repetitions, relationships, and types of fields of data discovered in a file, in
Informatica Intelligent Cloud Services
.
To use the model, you export it from
Data Integration
, and then can associate it with a data object in a Big Data Management mapping. You can run the mapping on the Spark engine to process the data. The mapping uses the
Intelligent structure model
to extract and parse data from input files based on the structure expressed in the model.