Table of Contents

Search

  1. Preface
  2. Introduction to Informatica Big Data Management
  3. Mappings
  4. Sources
  5. Targets
  6. Transformations
  7. Cluster Workflows
  8. Profiles
  9. Monitoring
  10. Hierarchical Data Processing
  11. Hierarchical Data Processing Configuration
  12. Hierarchical Data Processing with Schema Changes
  13. Intelligent Structure Models
  14. Stateful Computing
  15. Connections
  16. Data Type Reference
  17. Function Reference

User Guide

User Guide

Enable Scheduling and Node Labeling

Enable Scheduling and Node Labeling

To enable scheduling and node labeling in the Hadoop environment, update the yarn-site.xml properties in the cluster configuration.
Configure the following properties:
yarn.resourcemanager.scheduler.class
Defines the YARN scheduler that the Data Integration Service uses to assign resources on the cluster.
<property> <name>yarn.resourcemanager.scheduler.class</name> <value>org.apache.hadoop.yarn.server.resourcemanager.scheduler.[Scheduler Type].[Scheduler Type]Scheduler</value> </property>
For example:
<property> <name>yarn.resourcemanager.scheduler.class</name> <value>org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler</value> </property>
yarn.node-labels.enabled
Enables node labeling.
<property> <name>yarn.node-labels.enabled</name> <value>TRUE</value> </property>
yarn.node-labels.fs-store.root-dir
The HDFS location to update the node label dynamically.
<property> <name>yarn.node-labels.fs-store.root-dir</name> <value>hdfs://[Node name]:[Port]/[Path to store]/[Node labels]/</value> </property>

0 COMMENTS

We’d like to hear from you!