The Match Output view contains properties that specify the cache memory behavior and the match score threshold. You can also use the properties to determine how the transformation selects data store records for analysis and writes data store records as output.
After you select a match output type, configure the following properties:
Cache Directory
Specifies the directory to which the Data Integration Service writes temporary data during identity match analysis. The Data Integration Service writes temporary files to the directory when the volume of data that the match analysis generates is greater than the available system memory. The Data Integration Service deletes the temporary files after the mapping runs.
You can enter a directory path on the property, or you can use a system parameter to identify the directory. Specify a local path on the Data Integration Service machine. The Data Integration Service must be able to write to the directory. The default value is the CacheDir system parameter.
Cache Size
Determines the amount of system memory that the Data Integration Service assigns to identity match analysis. The default value is 400,000 bytes.
If the match analysis generates a greater amount of data, the Data Integration Service writes the excess data to the cache directory. If the match analysis requires more memory than the system memory and the file storage can provide, the mapping fails.
If you enter a value of 65536 or higher, the transformation reads the value in bytes. If you enter a lower value, the transformation reads the value in megabytes.
Match
Identifies the records to analyze when the transformation reads index data from database tables. Use the options on the
Match Type
view to identify the index tables.
By default, the transformation analyzes all the records in the data source and the index database tables. Configure the Match property to specify a subset of the records for duplicate analysis.
Output
Filters the records that the transformation writes as output when you configure the transformation to read index database tables. Use the options on the
Match Type
view to identify the index tables.
By default, the Match transformation writes all records from the data source and the index database tables as output. Configure the
Output
property when you do not need to review all the records in the input data.
Threshold
Sets the minimum match score that identifies two records as potential duplicates of each other.
You can assign a parameter to the threshold value. Set a decimal value in the range 0 through 1.