Table of Contents

Search

  1. Preface
  2. Introduction to Informatica Big Data Management
  3. Connections
  4. Mappings in the Hadoop Environment
  5. Mapping Objects in the Hadoop Environment
  6. Processing Hierarchical Data on the Spark Engine
  7. Stateful Computing on the Spark Engine
  8. Monitoring Mappings in the Hadoop Environment
  9. Mappings in the Native Environment
  10. Profiles
  11. Native Environment Optimization
  12. Data Type Reference
  13. Complex File Data Object Properties
  14. Function Reference
  15. Parameter Reference

Parallel Sorting

Parallel Sorting

To improve mapping performance, the Data Integration Service enables parallel sorting by default in a mapping that has a Sorter transformation and a flat file target.
The Data Integration Service enables parallel sorting for mappings in a Hadoop environment based on the following rules and guidelines:
  • The mapping does not include another transformation between the Sorter transformation and the target.
  • The data type of the sort keys does not change between the Sorter transformation and the target.
  • Each sort key in the Sorter transformation must be linked to a column in the target.


Updated November 09, 2018