You can use new workflow tasks to create a cluster workflow.
A cluster workflow creates a cluster on a cloud platform and runs Mapping and other workflow tasks on the cluster. You can choose to terminate and delete the cluster when workflow tasks are complete to save cluster resources.
Two new workflow tasks enable you to create and delete a Hadoop cluster as part of a cluster workflow:
Create Cluster Task
The Create Cluster task enables you to create, configure and start a Hadoop cluster on the following cloud platforms:
Amazon Web Services (AWS). You can create an Amazon EMR cluster.
Microsoft Azure. You can create an HDInsight cluster.
Delete Cluster Task
The optional Delete Cluster task enables you to delete a cluster after Mapping tasks and any other tasks in the workflow are complete. You might want to do this to save costs.
Previously, you could use Command tasks in a workflow to create clusters on a cloud platform. For more information about cluster workflows and workflow tasks, see the
Informatica 10.2.1 Developer Workflow Guide
In 10.2.1, the Command task method of creating and deleting clusters now supports Cloudera Altus clusters on AWS. For more information, see the article "How to Create Cloudera Altus Clusters with a Cluster Workflow on Big Data Management" on the Informatica Network.
Mapping task advanced properties include a new ClusterIdentifier property. The ClusterIdentifier identifies the cluster to use to run the Mapping task.
For more information about cluster workflows, see the