Rules and Guidelines for Intelligent Structure Models
Rules and Guidelines for
Intelligent Structure Model
s
Consider the following rules and guidelines when you work with an
intelligent structure model
:
When you create an
intelligent structure model
in Cloud
Data Integration
, select the
Data Integration
version that corresponds to your current Data Engineering Integration version.
The model or the sample file that you select for a data object should match the structure of the expected input files for the data object as much as possible. If an input file does not match the model that you select or that
Intelligent Structure Discovery
creates based on the sample file, or partially matches the model, there might be a large amount of unidentified data. Therefore, it is important to choose a file that represents the expected input files.
If you select a sample XML file that tags special characters that javax.xml.parsers can't parse, the data object fails to load the file. For example, javax.xml.parsers can't parse the U+035D unicode character.
You cannot use an
intelligent structure model
in a Write transformation.
A data object with an
intelligent structure model
parses PDF forms, Microsoft Word tables, and Microsoft Excel files whose size is less than the supported Hadoop split size of 256 MB.
An
intelligent structure model
can parse the data within PDF form fields but not data outside the fields.
An
intelligent structure model
can parse data within Microsoft Word tables. Other data is unidentified.