The Content Management Service is an application service that manages reference data. It provides reference data information to the Data Integration Service and to the Developer and Analyst tools. A master Content Management Service maintains probabilistic model and classifier model data files across the domain.
The Content Management Service manages the following types of reference data:
Address reference data
You use address reference data when you want to validate the postal accuracy of an address or fix errors in an address. Use the Address Validator transformation to perform address validation.
Identity populations
You use identity population data when you want to perform duplicate analysis on identity data. An identity is a set of values within a record that collectively identify a person or business. Use a Match transformation or Comparison transformation to perform identity duplicate analysis.
Probabilistic models and classifier models
You use probabilistic or classifier model data when you want to identify the type of information that a string contains. Use a probabilistic model in a Parser or Labeler transformation. Use a classifier model in a Classifier transformation. Probabilistic models and classifier models use probabilistic logic to identify or infer the type of information in the string. Use a Classifier transformation when each input string contains a significant amount of data.
Reference tables
You use reference tables to verify the accuracy or structure of input data values in data quality transformations.
The Content Management Service also compiles rule specifications into mapplets.
Use the Administrator tool to administer the Content Management Service. Recycle the Content Management Service to start it.