Configuring YARN in Informatica Big Data Management®

Configuring YARN in Informatica Big Data Management®

Example. Queues and Node Labeling on the Blaze Engine

Example. Queues and Node Labeling on the Blaze Engine

You can configure the Blaze engine to submit Blaze jobs to a YARN queue that is configured with a set of node labels. You can simultaneously configure the Blaze engine using a node label to start the engine on the cluster nodes that have the node label. The node label that you configure for the Blaze engine must be accessible by the YARN queue that receives submitted Blaze jobs.
For example, the Hadoop cluster might be divided into the following partitions:
  • memory. Partition of nodes that have higher memory.
  • fast. Partition of nodes that have higher CPU.
  • default. Default partition.
You configure a YARN queue
Blaze_only
using the node labels
fast
and
default
. When you configure the Blaze engine, you specify the queue
Blaze_only
and the node label
fast
. Note that the YARN queue can access the node label
fast
.
The following image shows the configurations for the Blaze engine in the Hadoop connection:
This image shows the Blaze configuration properties in the Hadoop connection. The value Blaze_only is set for the property YARN Queue Name and the value fast is set for the Blaze YARN Node Label.
The following image shows the nodes where the Blaze engine starts and the nodes that are accessible to the queue:
This image shows the nodes in the Hadoop cluster. The Blaze engine starts on the nodes in the fast partition. The jobs in the queue have access to two sub-clusters of nodes: the fast partition and the default partition.
  1. The Blaze engine starts on the nodes in the
    fast
    partition.
  2. The queue can access the
    fast
    and the
    default
    partitions. The jobs in the queue run on nodes in the
    fast
    and the
    default
    partitions.

0 COMMENTS

We’d like to hear from you!