Table of Contents

Search

  1. Preface
  2. Workflows
  3. Workflow Variables
  4. Workflow Parameters
  5. Cluster Tasks
  6. Command Task
  7. Human Task
  8. Mapping Task
  9. Notification Task
  10. Gateways
  11. Workflow Recovery
  12. Workflow Administration

Developer Workflow Guide

Developer Workflow Guide

Inclusive Gateway Example

Inclusive Gateway Example

Add Inclusive gateways to a workflow to run multiple tasks in parallel. For example, you can configure an Inclusive gateway with multiple branches and add a Mapping task to each branch. When the workflow runs, the Data Integration Service concurrently runs the mappings that the tasks identify.
You might configure multiple Mapping tasks in parallel when you need to process very large volumes of data. The tasks might identify mappings that read data from different locations and write the data to a warehouse. Add the Mapping tasks to the branches that the Inclusive gateways define. The Data Integration Service treats each mapping as a separate job. If you configure the Data Integration Service to run on a grid, the Data Integration Service can assign the mapping jobs to different nodes on the grid. If you configure the mappings to run in a Hadoop environment, the Data Integration Service pushes the mappings to the Hadoop cluster.
The following image shows a workflow that contains multiple Mapping tasks between Inclusive gateways:
The workflow defines multiple branches. All but one of the branches contain a Mapping task and a Notification task. The Notification task reports that the Mapping task ran. The default sequence flow starts a branch that contains a Notification task. A broken line in the sequence flow to the Notification task indicates that it is the default sequence flow.
The workflow contains the following objects:
  1. Start event. The Start event starts the workflow.
  2. Inclusive gateways. The two gateways split the workflow into multiple branches and merge the branches into a single flow.
  3. Mapping tasks. The workflow branches include a series of Mapping tasks. Each task identifies a mapping that reads a different data source. The mappings write to a common data target.
  4. Notification tasks. Each branch includes a Notification task that sends an email to the data owners when the corresponding mapping runs.
    The default sequence flow does not connect to a Mapping task. The sequence flow connects to a Notification task that notifies the workflow owner that the Data Integration Service reached the outgoing gateway.
  5. Mapping task. The final mapping task includes an Expression transformation that verifies the number of records that the earlier mappings wrote to the target database tables.
  6. Notification task. The final Notification task sends an email to the workflow developer to indicate that the workflow tasks are complete.
  7. End event. The End event ends the workflow.
If a Mapping task includes a mapping that you configure to run in a Hadoop environment, do not assign the Mapping task outputs to workflow variables. Mappings that run in a Hadoop environment do not provide the total number of source, target, or error rows. When a Mapping task includes a mapping that runs in a Hadoop environment, the task outputs contain a value of zero (0).

0 COMMENTS

We’d like to hear from you!