Table of Contents

Search

  1. Preface
  2. Introduction to PowerExchange for HDFS
  3. PowerExchange for HDFS Configuration
  4. HDFS Connections
  5. HDFS Data Objects
  6. HDFS Data Extraction
  7. HDFS Data Load
  8. HDFS Mappings
  9. Appendix A: Data Type Reference

PowerExchange for HDFS User Guide

PowerExchange for HDFS User Guide

Complex Files Output Collection Mode

Complex Files Output Collection Mode

When you write data to complex files, you can choose to collect the input rows and write the output to a single file, or create an output row for each input row.
You can specify the output collection mode in the Data Processor transformation based on the complex file type.
To specify the output collection mode in the Data Processor transformation, open the Data Processor transformation and click the
Settings
view. In the
Binary output collection mode
section, specify the output collection mode.
The following table describes the options that you can select for the output collection mode:
Property Name
Property Description
Collect input rows to a single output
Select this option if you want to collect all input rows and write the output to a single file.
Split output when size exceeds
When you write the output to a single file, you can choose to split the output file when it exceeds a particular size.
Enter the size in MB exceeding which the file must be split.
Default is 100 MB.
Output row for each input row (do not collect)
Select this option if you want to write an output row for each input row.

Output Collection Mode for Binary Files

When you write to binary files in a native or Hadoop environment, you can specify the output collection mode in the Data Processor transformation.

Output Collection Mode for Sequence Files and Custom Output Format Files

When you write to sequence files or custom output format files in a native environment, PowerExchange for HDFS writes all the key-value pairs into one output file. The number of key-value pairs that PowerExchange for HDFS writes depends on the output collection mode that you specified in the Data Processor transformation.

0 COMMENTS

We’d like to hear from you!