Hi, I'm Ask INFA!
What would you like to know?
ASK INFAPreview
Please to access Ask INFA.

Deduplicate assets

Deduplicate assets

Consolidation tab options

Consolidation tab options

Use the Consolidation tab options to configure the type of consolidation that a mapping will perform.
The following image shows the Consolidation tab options:
The image shows two overlapping views of the Consolidation tab. One view shows the options for row-based consolidation The other view shows the options for field-based consolidation.
The Consolidation tab includes the following options:
  1. Consolidation mode.
    Identifies the type of consolidation that the Deduplicate transformation will perform when the mapping runs. The type that you select determines how the transformation selects the preferred record in each set of duplicate records.
    Choose the row-based option to select a preferred record based on the quantity of data in the identity fields. Choose the field-based to build a preferred record from the data values across one or more records. You can also choose not to consolidate the duplicate record sets.
  2. Row strategy.
    Determines how the transformation will select the preferred record when you choose the row-based consolidation mode.
    Choose Most Data to specify the record with the greatest number of characters as the preferred record. Choose Most Filled to specify the row with the highest number of populated fields. Choose Modal Exact to select the record with the highest number of fields that contain the most common values in their respective columns.
  3. Field name column.
    Lists the fields in the input records that the Deduplicate transformation will read. The field name column is visible when you select the field-based consolidation mode. You can specify a consolidation strategy for each field when you select field-based consolidation.
  4. Strategy.
    Determines how the transformation selects the value in each field for the preferred record when you choose the field-based consolidation mode.
    You can select one of the following strategies:
    • Highest row ID. Use the value from the record with the highest row ID or sequence ID. Highest row ID is the default strategy.
    • Average. Use the average value across the records.
    • Longest. Use the longest value in the field across the records.
    • Maximum. Use the highest number in the field across the records. Or, choose the last value in alphabetical order.
    • Minimum. Use the lowest number in the field across the records. Or, choose the first value in alphabetical order.
    • Most frequent. Use the most frequently-occurring value in the field across the records, including blank, empty, or zero-length string fields.
      The consolidation operation will not add a null value to the preferred record.
    • Most frequent non-blank. Use the most frequently-occurring value in the field across the records, excluding null, blank, empty, or zero-length string fields.
    • Shortest. Use the shortest value in the field across the records.
  5. Type
    Indicates whether the asset created the field during the deduplication operation or whether you added the field to the asset in the Consolidation pane.
  6. Data Type
    Identifies the data type of the field. The default data type on all fields is String. You can modify the data type in field-based consolidation to suit your data requirements.
    You can select one of the following data types for a field:
    • Date/Time
    • Float
    • Integer
    • String
    If you modify the data type of a field, do not change the mode to No Consolidation or Row-based consolidation without first saving the asset. The asset discards any update that you make to a data type in Field-based consolidation mode if you change to another mode.
  7. Add field button.
    Adds one or more inputs to the consolidation operation when you select the field-based mode.
    Add fields in the following cases:
    • The identity analysis that you define on the
      Deduplication
      tab does not cover all of the fields that the transformation will analyze.
    • You want to specify a non-default strategy for the additional fields.
    The field-based strategies will apply to every input field that you map to the deduplicate asset in the transformation. If you do not specify a strategy for a field in field-based mode, the transformation applies the default strategy.

0 COMMENTS

We’d like to hear from you!