The following process flow summarizes the steps that you take to configure a Match transformation for field match analysis. You can define a process that uses the Match transformation alone or that uses the Match transformation and other transformations.
When you add a Match transformation to a mapping in field match analysis, add an upstream Key Generator transformation to the mapping.
To prepare the data for the Match transformation, perform the following steps:
Organize the source data records into groups.
Use a Key Generator transformation to assign a group key value to each record. The group assignments reduce the number of computations that the Match transformation must perform.
Verify that the data source records contain unique sequence identifier values. You can use a Key Generator transformation to create the values.
Perform the following steps in the Match transformation:
Specify field analysis as the match type, and specify the number of data sources.
If you configure the transformation to analyze two data sets, select a master data set.
Use the
Match Type
view to set the type and the number of data sources.
Define a match analysis strategy. Select an algorithm, and assign a pair of columns to the algorithm.
Use the
Strategies
view to define the strategy.
Specify the method that the transformation uses to generate the match analysis results.
Set the match threshold value. The match threshold is the minimum score that can identify two records as duplicates of one another.
Use the
Match Output
view to select the output method and the match threshold.
You can set the match threshold in a Match transformation or a Weighted Average transformation. Use the Weighted Average transformation if you create a match mapplet.