Data Engineering Integration
- Data Engineering Integration H2L
- All Products
Mp= Number of Parallel Map Task Containers = min of (4 x number of disks, number of logical cores) Rp= Number of Parallel Reduce Task Containers = min of (65% of number of logical cores, 3 x number of Disks)
yarn.nodemanager.resource.memory-mb
| The amount of physical memory in MB that can be allocated for containers. Informatica recommends reserving some memory for other processes running in a node.
|
yarn.nodemanager.resource.cpu-vcores
| The number of CPU cores that can be allocated for containers. Informatica recommends setting the value to the number of logical cores available in the node.
|
yarn.nodemanager.vmem-check-enabled
| The virtual memory check is set to false by default. Retain the default value.
|
mapreduce.map.memory.mb
| Memory allocated to map task containers.
|
mapreduce.reduce.memory.mb
| Memory allocated to reduce task containers.
|
Number of logical cores = 24 Number of disks = 7 Amount of physical memory available for containers = 64 GB Mp= min (4 X 7, 24) = min (28, 24) = 24 Rp= min (0.65 X 24, 3 X 7) = min (15.6, 21) = 15.6 ≈ 16 yarn.nodemanager.resource.memory-mb = 65536 mapreduce.map.memory.mb = yarn.nodemanager.resource.memory-mb / Mp= 65536 / 24 = 2730 mapreduce.reduce.memory.mb = yarn.nodemanager.resource.memory-mb / Rp= 65536 / 16 = 4096