Table of Contents

Search

  1. Preface
  2. Introduction
  3. Installation
  4. Design
  5. Operation

Clustering Process

Clustering Process

Clustering is the process of grouping similar or related records together. The logical rules, which define how to cluster records together, are specified using definition files and are stored in a special database called Rulebase.
For example, data records could be processed into the following groups:
  • all records about the same person
  • all records about the same family
  • all records about the same household
  • all records which have the same demographic attributes
Data may be clustered in many ways. Simple forms of clustering include records which:
  • have the same account number
  • have an identical name
  • have the same match-code
Complex forms of clustering may include records which:
  • could be about the same house
  • could be safely accepted as the same person
  • could possibly refer to the same person

0 COMMENTS

We’d like to hear from you!