Clustering is the process of grouping similar or related records together. The logical rules, which define how to cluster records together, are specified using definition files and are stored in a special database called Rulebase.
For example, data records could be processed into the following groups:
all records about the same person
all records about the same family
all records about the same household
all records which have the same demographic attributes
Data may be clustered in many ways. Simple forms of clustering include records which:
have the same account number
have an identical name
have the same match-code
Complex forms of clustering may include records which: