Table of Contents

Search

  1. Preface
  2. Part 1: Installation Getting Started
  3. Part 2: Before You Install the Services
  4. Part 3: Run the Services Installer
  5. Part 4: After You Install the Services
  6. Part 5: Informatica Client Installation
  7. Part 6: Uninstallation
  8. Appendix A: Starting and Stopping Informatica Services
  9. Appendix B: Connecting to Databases from UNIX or Linux

Installation for Enterprise Data Preparation

Installation for Enterprise Data Preparation

Create the Cluster Configuration

Create the Cluster Configuration

Create the cluster configuration, which contains configuration information about the non-native environment. The cluster configuration enables the Data Integration Service to push jobs to the non-native environment. You must create a cluster configuration if you plan to use
Enterprise Data Preparation
.
You import configuration properties from the non-native environment to create a cluster configuration. You can import the properties from an archive file that the Hadoop administrator creates, or you can import the properties directly from the cluster.
When you create the cluster configuration, you can also choose to create Hadoop, HDFS, Hive, HBase, or Databricks connections to the non-native environment. The installer appends the connection type to the cluster configuration name to create each connection name.
  1. Enter the name of the cluster configuration to create.
  2. Specify the non-native distribution for the cluster.
    The following table describes the options you can specify:
    Prompt
    Description
    1
    Cloudera. You can create a cluster configuration for a Cloudera cluster on Cloudera Distribution Hadoop (CDH).
    2
    Hortonworks
    3
    Azure HDInsight
    4
    MapR. You must import MapR cluster configuration properties from an archive file.
    5
    Amazon EMR. You must import Amazon EMR cluster configuration properties from an archive file.
    6
    Databricks
    7
    Google Dataproc
  3. Import configuration properties from the non-native environment to create the cluster configuration.
    • To import the properties from an archive file, press
      1
      . If you create a cluster configuration for an Amazon EMR, MapR, or Google Dataproc cluster, you must import the properties from an archive file.
    • To import the properties directly from the cluster, press
      2
      .
  4. If you choose to import the properties from an archive file, you must choose the configuration archive file name and path to the file.
  5. If you choose to import the properties directly from the cluster, specify the connection properties.
    The following table describes the Cloudera, Hortonworks, or Azure HDInsight cluster properties you specify:
    Property
    Description
    Host
    The host name or IP address of the cluster manager.
    Port
    Port of the cluster manager.
    User ID
    Cluster user name.
    Password
    Password for the cluster user.
    Cluster Name
    Name of the cluster. Use the display name if the cluster manager manages multiple clusters. If you do not provide a cluster name, the wizard imports information based on the default cluster.
    Engine type
    If you specified a Cloudera cluster, the installer prompts for the engine type.
    If you are on a CDP cluster, accept the default engine type of Tez. If you are on a CDH cluster, press
    2
    to set the engine type to MRv2.
    Default is
    1
    .
  6. To create the Hadoop, HDFS, Hive, HBase, or Databricks connections to the cluster, press
    1
    .
    The installer appends the connection type to the cluster configuration name to create a connection name.