Data Services All Products
When a Data Processor transformation reads Parquet input with Amazon EMR in the Hive environment, the read action might fail.
Workaround: Remove the following compression codecs from the
core-site.xmlproperty on all EMR instances in the clusters:
Add the following values:
In addition, remove the
When using UTF-8 encoding and non-English characters in field names while importing a JSON schema, some fields with non-English characters might not be mapped.
Workaround: Provide a schema and data with UTF-8 encoding.
A hierarchical to relational Data Processor transformation is set to accumulation mode with the following
Binary output port collection
> Collect input rows to a single output set. For a JSON to hierarchical transformation, if you change the output port to binary, the output will be multiple JSON data sets, with each set on a separate line.
The Data Processor transformation can show 100,002 markings by default in the Data Viewer.
Workaround: To show all markings, create the environment variable
IFCM_MAX_MARKINGSand set it equal to
Service parameters are not supported for a Data Processor transformation with relational input.
Workaround: Create two Data Processor transformations. Configure the first Data Processor transformation to use Input Mapping mode without the service. Configure the second Data Processor transformation with the service and define the service parameters. Link the output of the first transformation to the second transformation. Pass the service parameters with passthrough ports through the first transformation.