Table of Contents

Search

  1. Preface
  2. Introduction
  3. Definition File Overview
  4. Customization Steps
  5. Service Group Definition
  6. Algorithm Definition
  7. Edit-list Definition
  8. Matching Scheme Definition

Service Group Definition and Customization Guide

Service Group Definition and Customization Guide

N3SCD – Date Matching

N3SCD – Date Matching

This method compares two dates. It standardizes the dates into the form DDMMCCYY then compares them and returns a Score relative to the number of matching parts. Each part which matches scores 33% and the maximum Score is 100. Transposed characters within day, month or year score 25% out of 33%, with an additional 5% being subtracted for each additional part which is transposed or unmatched. (There is a special case Score of 85% for a match of DDMMCCYY to MMDDCCYY). The weight modifier returned is 100.
This method can also be used to decide if two dates are within a range of days.
LOPT=(DDMMYY / MMDDYY / YYMMDD)
The order of the date components being passed. Select only one. The default is
DDMMYY
. Note that even though these options do not explicitly refer to the century component, inclusion of the century is fully supported. It is highly recommended that the century be included in the dates being matched. The following month abbreviations are also accepted in the dates:
JAN, FEB, MAR, APR, MAY, JUN, JUL, JLY, AUG, SEP, OCT, NOV, DEC
To specify a different set of abbreviations, the
N3SCD
method definition should be set up to point to an Algorithm which has a Formatting User-exit with the different abbreviations (i.e.
FORMATTING=N3FTxx
).
LOPT=(RANGE*[number])
If the dates are [number] days apart or less then Score 100 – if they are less than twice [number] days apart then Score 50 – otherwise Score 0. [number] can be in the range 0 to 32767.
LOPT=(EXTDRNGE+RANGE*[number])
This option allows a more gradual degrading of the Score returned from the day range checking specified by
RANGE
. If the difference between two dates is more than the range specified Score = 0 otherwise Score = 100 - (difference between two dates * 100 / range) For example:
EXTDRNGE+RANGE*10 20000616 20000610
: Score = 100 - (6 * 100/10) = 100 - 60 = 040. 20000616 20000615: Score = 100 - (1 * 100/10) = 100 - 10 = 090. This option has no effect if
RANGE
is not specified.
LOPT=(OKTRANS)
Used in conjunction with
RANGE*[number]
and
EXTDRNGE*[number]
, this option causes the Method to transpose the month and day values in order to determine the lowest possible range. The lowest value is then used to determine the Score to be returned.
LOPT=(WZERO)
If parts of one or both dates are missing (e.g. the century is missing), set the weight to zero. If this option is not set, the method will attempt to interpret the date, but the results may not be consistent. For example, is 190305 equal to May 1903, 5th March 1919, 5th March 2019 or some other date.
LOPT=(NONULL)
Default behavior is to treat 00 in the year position as a null value. This option causes the method to treat 00 as a valid year.
LOPT=(YEARX*[number])
This option can be used to assign a century if dates are missing the century portion. A year value greater than
[number]
will cause a century of 19 to be used. A year value less than or equal to
[number]
will cause a century of 20 to be used.
LOPT=(EXTDDATE)
This option controls the matching of transposed characters within day, month and/or year components. The following table shows the Scores returned with
EXTDDATE
based on the matching, transposed or non-matching components:
Day (DD)
Month (MM)
Year (YY)
Score
Matched
Matched
Matched
100
Matched
Matched
Transposed
91
Matched
Matched
Unmatched
66
Matched
Transposed
Matched
91
Matched
Transposed
Transposed
78
Matched
Transposed
Unmatched
53
Matched
Unmatched
Matched
66
Matched
Unmatched
Transposed
53
Matched
Unmatched
Unmatched
33
Transposed
Matched
Matched
91
Transposed
Matched
Transposed
78
Transposed
Matched
Unmatched
53
Transposed
Transposed
Matched
78
Transposed
Transposed
Transposed
65
Transposed
Transposed
Unmatched
40
Transposed
Unmatched
Matched
53
Transposed
Unmatched
Transposed
40
Transposed
Unmatched
Unmatched
15
Unmatched
Matched
Matched
66
Unmatched
Matched
Transposed
53
Unmatched
Matched
Unmatched
33
Unmatched
Transposed
Matched
53
Unmatched
Transposed
Transposed
40
Unmatched
Transposed
Unmatched
15
Unmatched
Unmatched
Matched
33
Unmatched
Unmatched
Transposed
15
Unmatched
Unmatched
Unmatched
0
Matched MM
Matched DD
Matched
85
LOPT=(CLEANEDT)
The option turns on cleaning and editing rules for Date Matching except the major marker rules. To use this option, the Method Definition must specify the algorithm that uses the Edit-list containing any special rules. The type of Edit-list rules must be character rule definitions.
For example, consider matching a date of birth field, and the value "010101" is a null value that you do not want to consider. Use the following the Edit-list rule definition:
*S >010101< *W ><
In the matching definition, use the following method definition:
DEFINE METHOD=DATE,EP=N3SCD,ALGNAME=<AlgName>
Use the following scheme:
LOPT=(CLEANEDT)
Use the following Local Options to specify reasonably accurate Date Matching:
LOPT=(YYMMDD+RANGE*1+WZERO)
Use the following Local Options to specify reasonably loose Date Matching:
LOPT=(YYMMDD+RANGE*31+OKTRANS+EXTDDATE)

0 COMMENTS

We’d like to hear from you!