Table of Contents

Search

  1. Installation Getting Started
  2. Before You Install the Services
  3. Run the Big Data Suite Installer
  4. After You Install the Services
  5. Install the Developer Tool
  6. Uninstallation
  7. Starting and Stopping Informatica Services
  8. Connecting to Databases
  9. Updating the DynamicSections Parameter of a DB2 Database

Installation and Configuration Guide

Installation and Configuration Guide

Cluster Information for Enterprise Data Catalog

Cluster Information for Enterprise Data Catalog

When you install Enterprise Data Catalog, you must provide information about the cluster that Enterprise Data Catalog uses.
The following table describes cluster information to provide during installation:
Domain Information
Description
SSH username
User name for the password-less Secure Shell (SSH) connection.
Ambari server host
Host information for the Ambari server. Ambari is a web-based tool for provisioning, managing, and monitoring Apache Hadoop clusters, which includes support for Hadoop HDFS, Hadoop MapReduce, Hive, HBase and ZooKeeper.
Comma-separated Ambari agent hosts
Applies to high availability. If you use multiple Ambari agent hosts, specify the comma-separated values of multiple Ambari agent host names.
Ambari web port
Port number where the Ambari server needs to run.
Keytab Location
Applies to a Kerberos-enabled cluster. Location of the merged user and host keytab file.
Kerberos configuration file
Applies to a Kerberos-enabled cluster. Location of the Kerberos configuration.
YARN resource manager URI
The service within Hadoop that submits the MapReduce tasks to specific nodes in the cluster.
Use the following format: <host name>:<port>
  • <name node> is the host name or IP address of YARN resource manager.
  • <port> is the port that the YARN resource manager for Remote Procedure Calls (RPC).
YARN resource manager http URI
The http URI value for the YARN resource manager.
YARN resource manager scheduler URI
Scheduler URI value for the YARN resource manager.
ZooKeeper URI
The URI for the ZooKeeper service, which is a high-performance coordination service for distributed applications.
HDFS namenode URI
The URI to access HDFS.
Use the following format to specify the NameNode URI in the Cloudera distribution: hdfs://<name node>:<port>
  • <name node> is the host name or IP address of the NameNode.
  • <port> is the port that the NameNode listens for Remote Procedure Calls (RPC).
Service cluster name
Name of the service cluster. Ensure that you have a directory
/Informatica/LDM/<ServiceClusterName>
in HDFS before the installation is complete.
If you do not specify a service cluster name, Enterprise Data Catalog considers DomainName_CatalogServiceName as the default value. You must then have the /Informatica/LDM/<DomainName>_<CatalogServiceName>
History Server HTTP URI
HTTP URI to access the history server.