Table of Contents

Search

  1. Preface
  2. Introduction to Data Engineering Administration
  3. Authentication
  4. Running Mappings on a Cluster with Kerberos Authentication
  5. Authorization
  6. Cluster Configuration
  7. Cloud Provisioning Configuration
  8. Data Integration Service Processing
  9. Appendix A: Connections Reference
  10. Appendix B: Monitoring REST API

Data Engineering Administrator Guide

Data Engineering Administrator Guide

Importing a Hadoop Cluster Configuration from the Cluster

Importing a Hadoop Cluster Configuration from the Cluster

When you import the Hadoop cluster configuration directly from the cluster, you provide information to connect to the cluster.
Get cluster connection information from the Hadoop administrator.
  1. From the
    Connections
    tab, click the
    ClusterConfigurations
    node in the Domain Navigator.
  2. From the Actions menu, select
    New
    Cluster Configuration
    .
    The
    Cluster Configuration
    wizard opens.
  3. Configure the following General properties:
    Property
    Description
    Cluster configuration name
    Name of the cluster configuration.
    Description
    Optional description of the cluster configuration.
    Distribution type
    The cluster Hadoop distribution type.
    Method to import the cluster configuration
    Choose
    Import from cluster
    .
    Create connections
    Choose to create Hadoop, HDFS, Hive, and HBase connections.
    If you choose to create connections, the Cluster Configuration wizard associates the cluster configuration with each connection that it creates.
    The Hadoop connection contains default values for properties such as cluster environment variables, cluster path variables, and advanced properties. Based on the cluster environment and the functionality that you use, you can add to the default values or change the default values of these properties. For a list of Hadoop connection properties to configure, see Hadoop Cluster Connection Properties.
    If you do not choose to create connections, you must manually create them and associate the cluster configuration with them.
    When the wizard creates the Hive connection, it populates the Metadata Connection String and the Data Access Connection String properties with the value from the hive.metastore.uris property. If the Hive metastore and HiveServer2 are running on different nodes, you must update the Metadata Connection String to point to the HiveServer2 host.
    The cluster properties appear.
  4. Configure the following properties:
    Property
    Description
    Host
    Host name or IP address of the cluster manager.
    Port
    Port of the cluster manager.
    User ID
    Cluster user ID.
    Password
    Password for the user.
    Cluster name
    Name of the cluster. Use the display name if the cluster manager manages multiple clusters. If you do not provide a cluster name, the wizard imports information based on the default cluster.
  5. Click
    Next
    and verify the cluster configuration information on the summary page.

0 COMMENTS

We’d like to hear from you!