Table of Contents

Search

  1. Preface
  2. Introduction to Informatica Big Data Management
  3. Mappings in the Hadoop Environment
  4. Mapping Sources in the Hadoop Environment
  5. Mapping Targets in the Hadoop Environment
  6. Mapping Transformations in the Hadoop Environment
  7. Processing Hierarchical Data on the Spark Engine
  8. Configuring Transformations to Process Hierarchical Data
  9. Processing Unstructured and Semi-structured Data with an Intelligent Structure Model
  10. Stateful Computing on the Spark Engine
  11. Monitoring Mappings in the Hadoop Environment
  12. Mappings in the Native Environment
  13. Profiles
  14. Native Environment Optimization
  15. Cluster Workflows
  16. Connections
  17. Data Type Reference
  18. Function Reference
  19. Parameter Reference

Big Data Management User Guide

Big Data Management User Guide

Workflow Task Run-Time Behavior

Workflow Task Run-Time Behavior

Set mapping and Mapping task properties to specify where the workflow runs Mapping tasks.
You can create a cluster workflow to run some mappings on the cluster that the workflow creates, and other mappings on another cluster.
To specify where each mapping runs, configure options in the mapping and the Mapping task.
Run the mapping on the cluster that the workflow creates.
The following table describes the run-time behavior based on how you configure the mapping and Mapping task:
Mapping
Hadoop Connection Property
Mapping Task
Cluster Identifier Property
Run Time Behavior
Auto Deploy
Auto Deploy
The Data Integration Service generates temporary Hadoop connections based on the values in the Hadoop connection associated with the workflow, and uses the temporary connections to run mappings on the cluster.
<Hadoop connection name>
Auto Deploy
The Mapping task Cluster Identifier property overrides the mapping Hadoop connection property.
You might want to do this if you want to maintain the mapping Hadoop connection property value.
Run the mapping on another cluster.
The following table describes the run-time behavior based on how you configure the mapping and Mapping task:
Mapping
Hadoop Connection Property
Mapping Task
Cluster Identifier Property
Run Time Behavior
<Hadoop connection name>
Blank
The Mapping task Cluster Identifier property takes input from the Hadoop connection and runs the mapping on the cluster identified in the cloud configuration property of the Hadoop connection.

0 COMMENTS

We’d like to hear from you!