Table of Contents

Search

  1. Preface
  2. Introduction to Data Engineering Administration
  3. Authentication
  4. Running Mappings on a Cluster with Kerberos Authentication
  5. Authorization
  6. Cluster Configuration
  7. Cloud Provisioning Configuration
  8. Data Integration Service Processing
  9. Appendix A: Connections Reference
  10. Appendix B: Monitoring REST API

Data Engineering Administrator Guide

Data Engineering Administrator Guide

Import the Cluster Configuration

Import the Cluster Configuration

After you create the .xml file with the cluster properties, use the Administrator tool to import into the domain and create the cluster configuration.
  1. From the
    Connections
    tab, click the
    ClusterConfigurations
    node in the Domain Navigator.
  2. From the Actions menu, select
    New
    Cluster Configuration
    .
    The
    Cluster Configuration
    wizard opens.
  3. Configure the following properties:
    Property
    Description
    Cluster configuration name
    Name of the cluster configuration.
    Description
    Optional description of the cluster configuration.
    Distribution type
    The distribution type. Choose
    Databricks
    .
    Method to import the cluster configuration
    Choose
    Import from file
    .
    Upload configuration archive file
    The full path and file name of the file. Click the Browse button to navigate to the file.
    Create connection
    Choose to create a Databricks connection.
    If you choose to create a connection, the
    Cluster Configuration
    wizard associates the cluster configuration with the Databricks connection.
    If you do not choose to create a connection, you must manually create one and associate the cluster configuration with it.
  4. Click
    Next
    to verify the information on the summary page.

Dataproc version 2.0 clusters

When you create a cluster configuration for a Google Dataproc cluster, by default the cluster configuration is created for Dataproc version 1.4. To integrate with Dataproc 2.x clusters, you must manually update the cluster configuration version property to 2.0.
You only need to perform this workaround for Informatica versions 10.5.1x.
  1. In the Administrator tool, click Connections.
  2. Expand the Cluster Configurations node in the Domain Navigator and select the Dataproc cluster configuration.
  3. Edit the Distribution Version property of the Dataproc cluster configuration. Change the property value to
    2.0
    .
  4. Save the changes and restart .

0 COMMENTS

We’d like to hear from you!