When you add a step to a labeler asset, Data Quality prompts you to define a step with a regular expression or with a dictionary.
You add a dictionary to a labeler asset in
Data Quality
, and you add the
asset to the Labeler transformation in a mapping in
Data Integration
.
The following image shows the options that you configure when you define a step with a dictionary:
The dictionary options include the following properties:
Labeler mode.
Indicates the type of labeling operations to perform on the input data.
Add Step option.
Adds a step to the asset. A step describes a labeling operation that a mapping can apply to an input data field.
Up and Down options.
Moves a step that you select up or down within the step sequence.
Step sequence.
Defines the order in which a mapping applies each step to the input field at run time. The mapping performs labeling operations in the order that you specify.
Options name.
Identifies the dictionary that you add to the step.
Test input field.
Contains the input data that the Secure Agent uses to test the steps in the labeler sequence.
Test output fields.
Contain the result of the test.
The test output fields contain the following data:
A copy of the input field data in which any dictionary value is replaced by the label that you specify.
A copy of the input field data in which the discrete input values use the output field delimiter that you specify. The field can also return the valid dictionary version of any input value that matches a dictionary value.
Import file option.
Imports data to the test panel.
Step type.
Identifies the type of step to which the properties apply.
Dictionary name.
Identifies the dictionary that
the step applies to the input data.
You can see a preview of the
Reference 360
dictionary columns, but can't edit the columns. To edit the dictionary data,
go to
Reference 360
.
Label name.
Specifies the label that the step applies to the values in an input string. You enter the label.
Casing option.
Specifies whether the step considers the character case of the input values that it compares to the dictionary values. For example, the character case may be relevant if you plan to label person names from an input string, as the person names may begin with an uppercase letter.
Clear the option if the character case is not relevant to the data that the step will examine.
Mode option.
Determines the labeling method. Choose Inclusive or Exclusive mode to label values that match or do not match the dictionary values. The default mode is Inclusive.
In Inclusive mode, the labeling operation assigns the label to any value that matches a value in the dictionary.
In Exclusive mode, the operation assigns a label to any value that does not match a value in the dictionary.
Standardization option.
Specifies whether the step
will replace any input value that matches a dictionary value with the
corresponding value from the valid column in the dictionary.
The step compares the input values to every value in the dictionary. If the input
value matches a valid value, the step does not update the value. If you clear
the option, the step does not standardize any input value that matches a
dictionary value.
Valid dictionary column name.
Identifies the valid
column in the dictionary that you select. The valid column contains the
preferred versions of the values in the dictionary. The valid dictionary column
is active when you select the standardization option.