Table of Contents

Search

  1. Abstract for Profiling Sizing Guidelines
  2. Supported Versions
  3. Profiling and Discovery Sizing Guidelines

Profiling and Discovery Sizing Guidelines

Profiling and Discovery Sizing Guidelines

Hardware Guidelines for Column Profiles

Hardware Guidelines for Column Profiles

The factors that affect profile performance include the speed of the CPU, memory size, disk space, and the speed of the disk and network.
Consider the following hardware considerations for column profiles:
Component
Requirement
CPU
The Profiling Service Module uses the multithreaded environment of the Data Integration Service. Therefore, the CPU speed is less important than the number of cores in the CPU. To calculate the number of cycles that the Profiling Service Module uses each second, add the clock speeds of the cores.
Memory
Profile operations run faster with more memory. When you run a profile on a flat file source, the Profiling Service Module uses memory to sort the value frequency data and buffer data. The Profiling Service Module performs multiple read operations to the same part of a file by reading from the memory buffer and not from the flat file on disk. This method applies to rule profiling for flat file and relational sources.
Disk
The Profiling Service Module uses disk space for temporary storage when memory cannot store all the intermediate profile results. The Profiling Service Module uses multiple temporary directories in a single profile job. The Profiling Service Module divides the storage and Input/Output operations among multiple disks in parallel.
The profile performance increases if separate physical disks have the temporary directories. Disk technologies, such as rotational speed and on-disk buffering, affect profile performance.
Input/Output
The Input/Output speeds for memory, disk, and network affect the performance of the Profiling Service Module. Higher speeds allow the Profiling Service Module to quickly access large amounts of data. Network speed affects the relational databases that are not on the Data Integration Service machine and flat files that are on a storage device attached to the network.

0 COMMENTS

We’d like to hear from you!