You can read data from and write data to HDFS through a fixed-width or delimited flat file data object that does not contain binary data.
You can create or import a flat file data object. The data object properties that you specify in the Developer tool must match the properties of the source file.
After you create a flat file data object, you can edit the following file properties:
HDFS connection properties
Compression formats
To read large volumes of data, you can connect a flat file source to read data from a directory of flat files.
You can use the flat file data objects as a source, target, or lookup transformation in mappings and mapplets. You can select the mapping environment and run the mappings in a native or Hadoop run-time environment. You can create and run profiles against flat file data objects.
When you configure a mapping that contains flat file data objects to run in the native environment, you can enable the mapping for partitioning. The Data Integration Service can use multiple partitions to read data from flat file sources with an HDFS connection. The Data Integration Service can also use multiple partitions to write data to flat file targets with an HDFS connection. When the Data Integration Service adds partitions, it increases the number of processing threads, which can increase mapping performance.