Application and Database Design Guide

Application and Database Design Guide

Customer Identification Systems

Customer Identification Systems

This chapter provides a background to why Informatica’s approach to identity search and matching supports strong customer identification systems.

What Data to Use for Customer Look-up

Customer look-up is expected to be both quick and accurate.
In some systems, frequently the search will use an id-number, which is ideal for quick and accurate retrieval. In other systems identity numbers are just not available or the business does want to make its customer feel like a number or an account.
When an id-number is not available, the search will need to be driven by some other piece of identifying data.
One of the challenges for the system designer is to decide which attribute or attributes are the best to use for this identity search.
Given a choice of name, birth date, telephone numbers or an address, how does one determine the best?
In an ideal world, one would try combinations of each attribute over a period of time and measure the system’s results and the business benefits. In the real world, the decision often has to be made without empirical evidence.
Because dates suffer from the fact that a valid variation in any component creates a completely different but valid date, a search driven by a date is going to fail when one or more of the components are wrong.
Except where property addresses are the foundation of "customer" (for example, electricity and water companies), then addresses suffer from the fact that customers move regularly. A search driven by address is therefore going to fail when an address change has not been notified to the system.
Except when telephone numbers are the foundation of "customer" (example, telephone companies or utility and emergency services), then telephone numbers suffer from the fact that customers move and change them, use home, work, mobile and public numbers. A search driven by telephone numbers is therefore going to fail when the number has not previously been notified to the system. And like dates, errors in the number, or variations in format make indexing with such numbers quite unproductive.
Names avoid the pitfalls of dates, phone numbers and addresses. Unlike dates or telephone numbers, if a character in a name is different, then it still has a good chance of being identified because systems can compensate for variation and error in names. And unlike addresses and phone numbers, names tend to remain more stable over time.

Use of Full Name in the Customer Search

An important characteristic of a customer name search transaction is that the average customer actually wants to be identified and will provide a full name when requested.
In the majority of cases, that name will be given correctly and will match the data on file. If the search takes too long however, both the customer and the system resource manager will generally complain.
Assuming the tuning of the system and database is addressed, the response time of a name search is dependent upon the commonality of the name, the volume of data on file, the richness of the file name and the design of the keys.
If 1% of the customer data is about SMITH, a key built from family name alone in a database of 1,000,000 records could return 10,000 records for the SMITH search. A key built from family name + initial might reduce that volume to 500 records, but that is still too many. In addition, 1% of the customer searches will probably be about SMITH and so the problem gets worse.
If the customer take-on system only captured family name, or family name and initial, then these difficult to use results are the best one could expect.
Provided the customer take-on system captures the full name, and given that we are expecting the average customer to provide their full name for future access, the name search should be able to take advantage of this to search a much narrower set of records.
This requires the operator to understand that using the full name will improve the response time. Such a system must also allow the widening of the search in case the match could not be found at the initial full level of detail.

0 COMMENTS

We’d like to hear from you!