Table of Contents

Search

  1. Installation Getting Started
  2. Before You Install the Services
  3. Run the Big Data Suite Installer
  4. After You Install the Services
  5. Install the Developer Tool
  6. Uninstallation
  7. Starting and Stopping Informatica Services
  8. Connecting to Databases
  9. Updating the DynamicSections Parameter of a DB2 Database
  10. Silent Input Properties File

Installation and Configuration Guide

Installation and Configuration Guide

Create the Cluster Configuration

Create the Cluster Configuration

Create the cluster configuration, which contains configuration information about the Hadoop cluster. The cluster configuration enables the Data Integration Service to push jobs to the Hadoop environment.
You import configuration properties from the Hadoop cluster to create a cluster configuration. You can import the properties from an archive file that the Hadoop administrator creates, or you can import the properties directly from the cluster.
When you create the cluster configuration, you can also choose to create Hadoop, Hive, HBase, and HDFS connections to the Hadoop environment. If you want the installer to create and enable the Enterprise Data Lake services, you must create the connections.
  1. Enter the name of the cluster configuration to create.
  2. Specify the Hadoop distribution for the cluster.
    The following table describes the options you can specify:
    Option
    Description
    1
    Select to create a cluster configuration for a Cloudera cluster.
    2
    Select to create a cluster configuration for a Hortonworks cluster.
    3
    Select to create a cluster configuration for a Azure HDInsight cluster.
    4
    Select to create a cluster configuration for a MapR cluster. You must import the MapR cluster configuration properties from an archive file.
    5
    Select to create a cluster configuration for an Amazon EMR cluster. You must import the Amazon EMR cluster configuration properties from an archive file.
  3. Import configuration properties from the Hadoop cluster to create the cluster configuration.
    • To import the properties from an archive file, press
      1
      . If you create a cluster configuration for an Amazon EMR cluster or for a MapR cluster, you must import the properties from an archive file.
    • To import the properties directly from the cluster, press
      2
      .
  4. If you choose to import the properties directly from the cluster, specify the connection properties.
    The following table describes the properties you specify:
    Property
    Description
    Host
    The host name or IP address of the cluster manager.
    Port
    Port of the cluster manager.
    User ID
    Cluster user name.
    Password
    Password for the cluster user.
    Cluster Name
    Name of the cluster. Use the display name if the cluster manager manages multiple clusters. If you do not provide a cluster name, the wizard imports information based on the default cluster.
  5. To create the Hadoop, Hive, HDFS, and HBase connections associated with the cluster, press
    1
    .
The Data Preparation Repository Database section appears.