Table of Contents

Search

  1. Preface
  2. Part 1: Installation Getting Started
  3. Part 2: Before You Install the Services
  4. Part 3: Run the Services Installer
  5. Part 4: After You Install the Services
  6. Part 5: Informatica Client Installation
  7. Part 6: Uninstallation
  8. Appendix A: Starting and Stopping Informatica Services
  9. Appendix B: Connecting to Databases from UNIX or Linux
  10. Appendix C: Updating the DynamicSections Parameter of a DB2 Database

Installation for Data Engineering

Installation for Data Engineering

Cluster Configuration

Cluster Configuration

You import configuration properties from the non-native cluster to create a cluster configuration. The cluster configuration enables the Data Integration Service to push jobs to the non-native environment.
You can import the properties from an archive file that the Hadoop administrator creates, or you can import the properties directly from the cluster. When you create the cluster configuration, you can also choose to create Hadoop, Hive, HBase, HDFS, or Databricks connections associated with the cluster. The installer appends the connection type to the cluster configuration name to create each connection name.
The following table describes the initial information that you need to enter during the installation process:
Cluster Information
Description
Cluster configuration name
Name of the cluster configuration to create.
Distribution type
Type of non-native cluster distribution.
Cluster configuration import method
Method to import the cluster configuration. You can choose to import the cluster configuration from an archive file or from the cluster.

Import Cluster Configuration from an Archive File

To import the cluster configuration properties from an archive file, specify the path of the configuration archive file.

Import Cluster Configuration from the Cluster

The following table describes the cluster properties for Cloudera, Hortonworks, or Azure HDInsight that you need to enter when you import from cluster during the installation process:
Property
Description
Host
The host name or IP address of the cluster manager.
Port
Port of the cluster manager.
User ID
Cluster user name.
Password
Password for the cluster user.
Cluster Name
Name of the cluster. Use the display name if the cluster manager manages multiple clusters. If you do not provide a cluster name, the wizard imports information based on the default cluster.
Engine type
If you specified a Cloudera cluster, the installer prompts for the engine type.
If you are on a CDP cluster, accept the default engine type of Tez. If you are on a CDH cluster, set the engine type to MRv2.
The following table describes the cluster properties for Databricks that you need to enter during the installation process:
Property
Description
Databricks domain
URL of the Databricks cluster.
Databricks token ID
Token ID of the Databricks cluster.
Databricks cluster ID
Cluster ID of the Databricks cluster.