You can configure the following masking rules for substitution masking:
Repeatable Output
. Returns deterministic results between sessions. The Data Masking transformation stores masked values in the storage table.
Seed Value
. Apply a seed value to generate deterministic masked data for a column. Enter a number between 1 and 1,000.
Unique Output
. Force the Data Masking transformation to create unique output values for unique input values. No two input values are masked to the same output value. The dictionary must have enough unique rows to enable unique output.
When you disable unique output, the Data Masking transformation might not mask input values to unique output values. The dictionary might contain fewer rows.
Unique Port
. The port used to identify unique records for substitution masking. For example, you want to mask first names in a table called Customer. If you select the table column that contains the first names as the unique port, the data masking transformation replaces duplicate first names with the same masked value. If you select the Customer_ID column as the unique port, the data masking transformation replaces each first name with a unique value.
Optimize Dictionary Usage
.
Applicable if you select the
Repeatable Output
option. Increases the usage of masked values from a dictionary.
Dictionary Information
. Configure the reference table that contains the substitute data values. Click
Select Source
to select a reference table.
Dictionary Name
. Displays the name of the reference table that you select.
Dictionary Connection
. Displays the name of the connection that contains the dictionary.
Serial Number Column
. Choose the column to return to the Data Masking transformation.
Sort Column
. The dictionary column on which you want to sort entries. Specify a sort column to generate deterministic results even if the order of entries in the dictionary changes. For example, if you move a relational dictionary and the order of entries changes, sort on the serial number column to consistently mask the data.
The column that you choose must contain unique values. You cannot use a column that might contain duplicate values to sort the data.
Output Column
. Choose the column to return to the Data Masking transformation.
Lookup condition.
Configure a lookup condition to further qualify what dictionary row to use for substitution masking. The lookup condition is similar to the WHERE clause in an SQL query. When you configure a lookup condition you compare the value of a column in the source with a column in the dictionary.
For example, you want to mask the first name. The source data and the dictionary have a first name column and a gender column. You can add a condition that each female first name is replaced with a female name from the dictionary. The lookup condition compares gender in the source to gender in the dictionary.