Table of Contents

Search

  1. Installation Getting Started
  2. Before You Install the Services
  3. Run the Big Data Suite Installer
  4. After You Install the Services
  5. Install the Developer Tool
  6. Uninstallation
  7. Starting and Stopping Informatica Services
  8. Connecting to Databases
  9. Updating the DynamicSections Parameter of a DB2 Database

Installation and Configuration Guide

Installation and Configuration Guide

Embedded Cluster Node Management

Embedded Cluster Node Management

A Hadoop cluster has a set of machines that is configured to run Hadoop applications and services. A typical Hadoop cluster includes a master node and multiple slave or worker nodes. The master node runs the master daemons JobTracker and NameNode. A slave node runs the DataNode and TaskTracker daemons. In small clusters, the master node might also run the slave daemons.

Cluster with High Availability

You can use the highly availability option for the HDFS, HBase, YARN, and Solr components of the embedded Hadoop cluster environment. If you set up Informatica Cluster Service on a multi-node and highly available cluster, you need a minimum of three nodes for Enterprise Data Catalog to function successfully. If you have already set up Informatica Cluster Service on a single node, you cannot make the cluster highly available by adding more nodes to the cluster.
If the embedded cluster contains only three nodes, Enterprise Data Catalog distributes all master and slave services on all the three nodes. If the embedded cluster contains more than three nodes, Enterprise Data Catalog automatically chooses top three nodes with the highest system configuration as master nodes. The remaining nodes serve as slave nodes. When you add nodes to the embedded cluster, the newly added nodes serve as slave nodes. The nodes that you add to the cluster must meet the minimum configuration requirements for slave nodes.

Cluster without High Availability

You can set up Informatica Cluster Service on a single node that is not highly available. In such cases, the master and worker nodes remain on the same node. You cannot bring up Informatica Cluster Service if you add a single node to an existing single-node cluster or try to set up Informatica Cluster Service with two nodes.

Delete Nodes

You can delete nodes from the embedded cluster provided they meet the following conditions:
  • You cannot delete a master node.
  • You cannot delete a node if the number of live data nodes in the cluster becomes less than three on deleting the node.