Table of Contents

Search

  1. Preface
  2. Introduction to Informatica Data Engineering Integration
  3. Mappings
  4. Mapping Optimization
  5. Sources
  6. Targets
  7. Transformations
  8. Python Transformation
  9. Data Preview
  10. Cluster Workflows
  11. Profiles
  12. Monitoring
  13. Hierarchical Data Processing
  14. Hierarchical Data Processing Configuration
  15. Hierarchical Data Processing with Schema Changes
  16. Intelligent Structure Models
  17. Blockchain
  18. Stateful Computing
  19. Appendix A: Connections Reference
  20. Appendix B: Data Type Reference
  21. Appendix C: Function Reference

Prepare Mappings for the Cluster Workflow

Prepare Mappings for the Cluster Workflow

Prepare mappings for inclusion in the cluster workflow by setting mapping and Mapping task properties to specify where mappings run.
You can create a cluster workflow to run some mappings on the cluster that the workflow creates, and other mappings on another cluster.
Run the mapping on the cluster that the workflow creates.
The following table describes the run-time behavior based on how you configure the mapping and Mapping task:
Mapping
Hadoop Connection Property
Mapping Task
Cluster Identifier Property
Run Time Behavior
Auto Deploy
Auto Deploy
The Data Integration Service generates temporary Hadoop connections based on the values in the Hadoop connection associated with the workflow, and uses the temporary connections to run mappings on the cluster.
<Hadoop connection name>
Auto Deploy
The Mapping task Cluster Identifier property overrides the mapping Hadoop connection property.
You might want to do this if you want to maintain the mapping Hadoop connection property value.
Run the mapping on another cluster.
The following table describes the run-time behavior based on how you configure the mapping and Mapping task:
Mapping
Hadoop Connection Property
Mapping Task
Cluster Identifier Property
Run Time Behavior
<Hadoop connection name>
Blank
The Mapping task Cluster Identifier property takes input from the Hadoop connection and runs the mapping on the cluster identified in the cloud configuration property of the Hadoop connection.