Cluster with High Availability
You can use the highly availability option for the HDFS, HBase, YARN, and Solr components of the embedded Hadoop cluster environment. If you set up Informatica Cluster Service on a multi-node and highly available cluster, you need a minimum of three nodes for Enterprise Data Catalog to function successfully. If you have already set up Informatica Cluster Service on a single node, you cannot make the cluster highly available by adding more nodes to the cluster.
If the embedded cluster contains only three nodes, Enterprise Data Catalog distributes all master and slave services on all the three nodes. If the embedded cluster contains more than three nodes, Enterprise Data Catalog automatically chooses top three nodes with the highest system configuration as master nodes. The remaining nodes serve as slave nodes. When you add nodes to the embedded cluster, the newly added nodes serve as slave nodes. The nodes that you add to the cluster must meet the minimum configuration requirements for slave nodes.