User Guide

10.0 HotFix 1
- 10.5 HotFix 3
- 10.5 HotFix 2
- 10.5 HotFix 1
- 10.5
- 10.2 HotFix 1
- 10.2
- 10.1
- 10.0

Back Next

Performance Optimization

The Clustering Process reads records from the database or input file. For each record, it generates a search range using the

KEY-FIELD

. The Name Index is searched to create a list of candidate records with similar keys. The set of candidates are then read from the database and scored against the search record.

The process can be optimized by

reducing the size of the candidate set, thereby reducing the amount of scoring required, and/or

reducing the cost of scoring two records

utilizing multiple CPUs

reducing database I/O

The following sections discuss ways in which to achieve these goals.

Design

Partitions

Key Data / Key-Score-Logic

Pre-Scoring

Scoring Phases

Utilizing multiple CPUs

Reducing Database I/O

Download Guide

Watch

Comments

Communities

Knowledge Base

Success Portal

0 COMMENTS

We’d like to hear from you! Log in to comment.

Rename Saved Search

Table of Contents

User Guide

User Guide

Performance Optimization

Performance Optimization