Table of Contents


  1. Preface
  2. Part 1: Hadoop Integration
  3. Part 2: Databricks Integration
  4. Appendix A: Connections

Cloud Provisioning Configuration

Cloud Provisioning Configuration

The cloud provisioning configuration establishes a relationship between the Create Cluster task and the cluster connection that the workflows use to run mapping tasks. The Create Cluster task must include a reference to the cloud provisioning configuration. In turn, the cloud provisioning configuration points to the cluster connection that you create for use by the cluster workflow.
The properties to populate depend on the Hadoop distribution you choose to build a cluster on. Choose one of the following connection types:
  • AWS Cloud Provisioning. Connects to an Amazon EMR cluster on Amazon Web Services.
  • Azure Cloud Provisioning. Connects to an HDInsight cluster on the Azure platform.
  • Databricks Cloud Provisioning. Connects to a Databricks cluster on the Azure Databricks platform.


We’d like to hear from you!