Hi, I'm Ask INFA!
What would you like to know?
ASK INFAPreview
Please to access Ask INFA.

Deduplicate assets

Deduplicate assets

Configuring the consolidation process

Configuring the consolidation process

To define a consolidation process, configure the options on the
Consolidation
tab.
  1. Select the
    Consolidation
    tab on the asset.
  2. Select a consolidation mode. The mode defines the method that the Deduplicate transformation uses to consolidate sets of duplicate records into single, preferred records.
    You can select one of the following modes:
    • No consolidation. The transformation does not create preferred records from the sets of matching records that the duplicate analysis defines.
    • Row-based consolidation. The transformation selects a preferred record from each set of matching records based on a strategy that you select.
    • Field-based consolidation. The transformation creates a preferred record based on a strategy that you specify for each field in the records. Field-based consolidation can combine values from more than one record to create a preferred record that is not present in the source data.
  3. Configure the row-based or field-based strategies for the fields in the input records.
  4. If you selected the field-based consolidation mode, optionally add one or more fields to the consolidation model.
    The following field names are reserved on the deduplicate asset:
    • ClusterId
    • ClusterSize
    • DriverId
    • DriverScore
    • GroupKey
    • LinkId
    • LinkScore
    • RowId
    • SequenceId
    Do not create a field with a reserved name.
  5. Save the transformation.
For more information about the consolidation options, see Consolidation tab options.

0 COMMENTS

We’d like to hear from you!