Table of Contents

Search

  1. Abstract for Profiling Sizing Guidelines
  2. Supported Versions
  3. Profiling and Discovery Sizing Guidelines

Profiling and Discovery Sizing Guidelines

Profiling and Discovery Sizing Guidelines

Hardware Guidelines for Key and Functional Discovery

Hardware Guidelines for Key and Functional Discovery

The Profiling Service Module processes a data source sample to infer the keys and functional dependencies. The bandwidth requirement for flat files and relational databases is less because the data size is usually small.
Both key and functional dependency discovery algorithms have large CPU resource and temporary disk space requirements. The algorithms use memory to cache between the intermediate results and temporary disk.
The factors that affect profile performance include CPU, memory, disk size, and disk speed:
Component
Requirements
CPU
Uses one CPU for each mapping.
Memory
Requires 256 MB of memory in addition to the mapping memory.
Disk Size
Caches intermediate profile results to the disk and the required amount of disk space depends on the complexity of data and the number of columns. You can consider a minimum of 128 GB disk space.
Disk Speed
The input/output speeds, for both memory and disk, affect the Profiling Service Module performance. Higher speeds allow the Profiling Service Module to quickly access large amounts of data.

0 COMMENTS

We’d like to hear from you!