Developer Transformation Guide

10.5
- 10.5.2
- 10.4.1
- 10.4.0

Back Next

Viewing Match Cluster Analysis Data

You can view statistical data on the clusters that the transformation can create. The cluster statistics summarize the level of record duplication in the data set based on the current mapping configuration.

To view the data, right-click the Match transformation in the mapping canvas and select

Match Cluster Analysis

Before you run the analysis, validate the mapping that contains the transformation.

Match cluster analysis displays data for the following properties:

Property	Description
Source	The number of input data rows.
Last run	The date and time of the analysis.
Total number of discovered clusters	The number of clusters that the match analysis generates when the mapping runs.
Minimum cluster size	The number of records in the cluster or clusters that contain the fewest records. If the minimum cluster size is 1, the data set contains at least one unique record.
Maximum cluster size	The number of records in the cluster or clusters that contain the most records. If this value greatly exceeds the average cluster size, the largest cluster might contain false duplicates.
Number of unique records	The number of records in the data set that do not match another record with a score that meets the match threshold.
Number of duplicate records	The number of records in the data set that match another record with a score that meets the match threshold.
Total comparisons	The number of comparison operations that the mapping performs.
Average cluster size	The average number of records in a cluster.

Rename Saved Search

Table of Contents

Developer Transformation Guide

Developer Transformation Guide

Viewing Match Cluster Analysis Data

Viewing Match Cluster Analysis Data