Table of Contents

Search

  1. Installation Getting Started
  2. Before You Install the Services
  3. Run the Big Data Suite Installer
  4. After You Install the Services
  5. Install the Developer Tool
  6. Uninstallation
  7. Starting and Stopping Informatica Services
  8. Connecting to Databases
  9. Updating the DynamicSections Parameter of a DB2 Database
  10. Silent Input Properties File

Installation and Configuration Guide

Installation and Configuration Guide

Create the Data Preparation Service

Create the Data Preparation Service

When you install Enterprise Data Lake on the master gateway node for the domain, you can create the Enterprise Data Lake Service and the Data Preparation Service during installation.
If you do not create the Enterprise Data Lake Service and the Data Preparation Service during installation, or if you install Enterprise Data Lake on another gateway node in the domain, you can use the Administrator tool to create the services after you install the Enterprise Data Lake binaries.
  1. Specify the name of the Data Preparation Service.
    The name is not case sensitive and must be unique within the domain. It cannot exceed 128 characters or begin with @. It also cannot contain spaces or the following special characters: ` ~ % ^ * + = { } \ ; : ' " / ? . , < > | ! ( ) ] [
  2. If you plan to use rules, you must associate a Model Repository Service and a Data Integration Service with the Data Preparation Service.
    • To skip associating a Model Repository Service and a Data Integration Service with the Enterprise Data Lake Service, press
      1
      .
    • To associate a Model Repository Service and a Data Integration Service with the Data Preparation Service, press
      2
      , and then enter the service names.
  3. To create the Data Preparation Service during installation, enter the name of the current node.
    If you do not want to create the service during installation, do not enter a value. You can use the Administrator tool to create the service after installation.
    If you create the Enterprise Data Lake Service and the Data Preparation Service during installation, you must create both services on the same node.
  4. Choose whether to enable secure communication for the Data Preparation Service.
    • To enable secure communication for the Data Preparation Service, press
      1
      .
    • To disable secure communication, press
      2
      .
  5. If you enable secure communication for the service, select the SSL certificate to use.
    • To use the default Informatica SSL certificate contained in the default keystore and the default truststore, press
      1
      .
    • To use a custom SSL certificate contained in a custom keystore and truststore, press
      2
      , and then enter the path and file name for the keystore and truststore files. You must also enter the keystore and truststore passwords.
  6. If you enable secure communication for the service, enter the port number for the HTTPS connection. If you enable non-secure communication for the service, enter the port number for the HTTP connection.
  7. Select the Hadoop authentication mode.
    • To select the non-secure authentication mode, press
      1
      .
    • To select Kerberos authentication, press
      2
      .
  8. If you select Kerberos, enter the authentication parameters.
    The following table describes the authentication parameters that you must set if you select Kerberos:
    Property
    Description
    HDFS Principal Name
    Service Principal Name (SPN) for the data preparation Hadoop cluster. Specify the service principal name in the following format: user/_HOST@REALM.
    Hadoop Impersonation User Name
    User name to use in Hadoop impersonation as shown in the Impersonation User Name property for the Hadoop connection in the Administrator tool.
    If the Hadoop cluster uses Kerberos authentication, the Hadoop impersonation user must have read, write, and execute permissions on the HDFS storage location folder.
    Kerberos Keytab File
    Path and file name of the SPN keytab file for the user account to impersonate when connecting to the Hadoop cluster. The keytab file must be in a directory on the machine where the Data Preparation Service runs.
  9. Specify the HDFS storage location, HDFS connection, local storage location, and Solr port number details.
    The following table describes the properties you must set:
    Property
    Description
    HDFS Storage Location
    HDFS location for data preparation file storage. If the Hadoop cluster uses Kerberos authentication, the Hadoop impersonation user must have read, write, and execute permissions on the HDFS storage location folder.
    HDFS Connection
    HDFS connection for data preparation file storage.
    Local Storage Location
    Directory for data preparation file storage on the node on which the Data Preparation Service runs. If the connection to the local storage fails, the Data Preparation Service recovers data preparation files from the HDFS storage location.
    Solr port
    Solr port number for the Apache Solr server used to provide data preparation recommendations.
  10. Choose whether to enable the Data Preparation Service.
    • To enable the service at a later time using the Administrator tool, press
      1
      .
    • To enable the service after you complete the installation process, press
      2
      .
The Enterprise Data Lake Service section appears.