Table of Contents

Search

  1. Preface
  2. Introduction to Hadoop Integration
  3. Before You Begin
  4. Amazon EMR Integration Tasks
  5. Azure HDInsight Integration Tasks
  6. Cloudera CDH Integration Tasks
  7. Hortonworks HDP Integration Tasks
  8. MapR Integration Tasks
  9. Appendix A: Connections

Hadoop Integration Guide

Hadoop Integration Guide

Cloud Provisioning Configuration

Cloud Provisioning Configuration

The cloud provisioning configuration establishes a relationship between the Create Cluster task and the cluster connection that the workflows use to run mapping tasks. The Create Cluster task must include a reference to the cloud provisioning configuration. In turn, the cloud provisioning configuration points to the cluster connection that you create for use by the cluster workflow.
The properties to populate depend on the Hadoop distribution you choose to build a cluster on. Choose one of the following connection types:
  • AWS Cloud Provisioning. Connects to an Amazon EMR cluster on Amazon Web Services.
  • Azure Cloud Provisioning. Connects to an HDInsight cluster on the Azure platform.
  • Databricks Cloud Provisioning. Connects to a Databricks cluster on the Azure Databricks platform.

0 COMMENTS

We’d like to hear from you!