Errors made in spelling the spoken word.
Transcription and keying errors for written names and codes.
Missing words, initials, numbers or codes.
Mixed usage of first names and initials.
Mixed usage of 1, 2, etc. with. .. one, two . . . 1st, 2nd, .. First, Second, etc.
Nicknames, formal and informal abbreviations, synonyms, language variation of common words.
Concatenation or splitting of words and codes.
Extra words and word sequence variations.
Presence of irrelevant "noise words" in the data.
Missing or "null" data.
Presence of "foreign" names and addresses.
Failures to find all parts of compound or account names where multiple entities are present in one name field.
Anglicization (Localization) of names causing variation between formal name, as on a Passport or Driver’s License, and less formal names on other transactions.
The problems created by the frequent use of certain common last and first names, or use of common words and numbers in organization names and addresses.
The fact that many names can be made from title or "noise" words. For example, Sister J Bishop, The Limited, The Company Inc.