Configure the Complex File Reader For Parquet Input
Configure the Complex File Reader For Parquet Input
After you create a Data Processor transformation that converts Parquet input, add the transformation to a mapping with a complex file reader. Configure the complex file reader to process Parquet input.
In the Mapping editor, create a complex file reader object.
To configure the complex file reader, perform the following steps:
In the
Advanced
tab of the
Properties
view, select the
File Format
property, and then choose
Input Format
.
In the
Advanced
tab, select the
Input Format
property, and then type
com.informatica.parquet.ParquetToXML
.
To optimize performance, use the
Input Format Parameters
property to tune the
MaxOutputAccumulation
parameter. By default, the
MaxOutputAccumulation
parameter, which defines the expected number of output records, is set to 50,000. To change the setting to 250,000 for example, enter
"MaxOutputAccumulation"="250000"
.
By default, the complex file reader adds the schema to the complex file reader output within a single element directly after the root element. If you do not want to add the schema to the output, select the
Input Format Parameters
property, then type
"InjectSchema"="false"
.
Use a semi-colon to separate multiple parameters, for example
Add the Data Processor transformation to the mapping. The transformation input port should remain set to binary input, the default setting for Parquet input.
Link the complex file reader output port to the Data Processor transformation input port. The complex file reader output port should remain set to binary output.