, you can build and run cleanse functions that cleanse data.
A
cleanse function
is a function that the
MDM Hub
applies to a data value in a record to standardize or verify it. For example, if the data has a column for salutation, you could use a cleanse function to standardize all instances of “Doctor” to “Dr.” You can apply cleanse functions successively, or simply assign the output value to a column in the staging table.
Types of Cleanse Functions
The
MDM Hub
can have one of the following types of cleanse functions:
Cleanse function defined by the
MDM Hub
Cleanse function defined by a cleanse engine
Custom cleanse function that you define
The pre-defined functions provide access to specialized cleansing functionality, such as name and address standardization, address decomposition, gender determination, and so on. See the console for more information on the Cleanse Function tool.
Libraries
Functions are organized into
libraries
—Java libraries and user libraries, which are folders used to organize the functions that you can use in the Cleanse Functions tool in the Model workbench.
Cleanse Functions are Secure Resources
You can configure cleanse functions as secure resources and make them SECURE or PRIVATE.
Available Functions Subject to Cleanse Engine
The functions you see in the
Hub Console
depend on the cleanse engine that you use. The
MDM Hub
shows the cleanse functions that the cleanse engine makes available. Regardless of which cleanse engine you use, the overall process of data cleansing in the