Connect the Match transformation output ports to the data target in the mapping. Select the ports that contain the record data that you want to write to the data target.
The transformation includes a series of preset ports for clustered data. Select the preset ports that indicate duplicate status of the records and identify the data source that stores each record.
The following ports contain data that you can use to find duplicate records and determine the source or the records:
The
ClusterSize
port indicates the number of records in a cluster. If a record belongs to a cluster with a cluster size greater than 1, the transformation considers the record to be a duplicate of another record.
The
ClusterID
port identifies the cluster that a record belongs to. Use the ClusterID data to find the records that are duplicates of the current record.
The
PersistenceStatus
port uses a code value to describe the relationship between the index data from the mapping source and the index data in the data store.
The
PersistenceStatusDesc
port returns a text description of the values on the PersistenceStatus port code.
You can use other ports to review the relationships between the cluster records. The link port values and driver port values indicate the extent of the similarity between the records in each cluster.
In the current example, you connect all the ports to the data target. To view the output data on the ports, run the Data Viewer.