You can define different types of duplicate analysis in the Match transformation. The duplicate analysis operations that you define depend on the number of data sources in the mapping and the type of information that the sources contain.
Consider the following factors when you configure a Match transformation:
You can select a single column from a data set or you can select multiple columns.
You can analyze columns from a single data source or you can analyze two data sources.
You can configure the Match transformation to analyze the raw data in the input port fields, or you can configure the transformation to analyze the identity information in the data.
You can configure the Match transformation to write different types of output. The type of output that you select determines the number of records that the transformation writes and the order of the records.
To increase performance, sort the input records into groups before you perform the match analysis.