Data Quality accelerator bundles

Data Quality accelerator bundles

Data Quality bundle for Australia and New Zealand

Data Quality bundle for Australia and New Zealand

Use the Data Quality bundle for Australia and New Zealand to accelerate the configuration and deployment of data quality solutions in Australia and New Zealand organizations. The bundle includes mapplets and other assets that define data standardization, deduplication, address verification, and parsing operations for Australia and New Zealand data.
The following table lists the assets contained in the Data Quality bundle for Australia and New Zealand:
Name
Asset Type
Description
abn_acn_exception_values
DICTIONARY
Contains a list of values that cannot be used as a valid Australian Business Number (ABN) or Australian Company Number (ACN).
aus_bsb_codes
DICTIONARY
Contains a list of Australian bank branch codes.
aus_company_name_std_infa
DICTIONARY
Contains a sample of company names and alternatives.
aus_gender_infa
DICTIONARY
Contains a list of given names with associated gender. Includes "M" for male names, "F" for female names, and "U" if the gender is unknown.
aus_nicknames_infa
DICTIONARY
Contains a list of names and alternatives.
aus_prename_gender_infa
DICTIONARY
Contains a list of courtesy titles and the gender associated with the titles.
aus_telephone_prefix
DICTIONARY
Contains a list of Australian telephone prefix numbers from Australian Government Federal Register of Legislation.
c_aus_areacode_region
CLEANSE
Checks the input data against a list of Australian telephone prefix numbers from Australian Government Federal Register of Legislation and returns
VALID
if the data values match.
c_aus_get_company_name_std
CLEANSE
Standardizes company name for Australia.
c_aus_remove_intl_dialling_prefix
CLEANSE
Removes variations of the Australian international dialing code from the start of the input string. For example:
+61, 0061, 61
c_av6_assign_dq_status_code_description
CLEANSE
Returns the text description of a Match Code value that a verifier asset generates.
c_get_name_standard
CLEANSE
Replaces Australian nick names with the defined valid column.
c_merge_fields
CLEANSE
Merges the Overflow, Before, After and Unparsed fields from the parse asset.
c_merge_two_fields
CLEANSE
Merges two fields.
c_multi_uppercase
CLEANSE
Converts character case to uppercase for up to six input fields.
c_nzl_areacode_region
CLEANSE
Returns the name of the area code from the area code dialing prefix.
c_nzl_phone_line
CLEANSE
Removes the area codes from a New Zealand phone number.
c_nzl_remove_intl_dialling_prefix
CLEANSE
Removes the New Zealand international dialing prefixes.
c_nzl_standardize
CLEANSE
Replaces multiple zeros at the start of a string with a single zero.
c_remove_labelled_character
CLEANSE
Replaces an
X
,
_
and
S
with
@
.
c_Remove_Non_Digit_Chars
CLEANSE
Removes
@
and
+
symbols.
c_remove_non_numbers
CLEANSE
Removes non-numbers from a string.
c_Remove_Punctuation
CLEANSE
Removes all occurrences of punctuation from the input field.
c_Remove_Space
CLEANSE
Removes all occurrences of a character space from the input field.
c_remove_symbols_chars
CLEANSE
Removes symbols and letters.
c_remove_x
CLEANSE
Removes
X
or
x
from the start of an input.
c_Replace_Limited_Punctuation_w_Space
CLEANSE
Replaces any forward slash, backslash, exclamation point, period, or underscore character with a character space. Also replaces instances of multiple character spaces with a single space.
c_standardise_area_code
CLEANSE
Replaces the area code with the region.
c_Uppercase
CLEANSE
Converts the input text to uppercase.
dedupe_aus_familyname_and_address_match
DEDUPLICATE
Identifies duplicate records in Australian data based on similarities in family name and address data. The operation uses Address Part1 as the index key.
dedupe_aus_individual_name_and_address
DEDUPLICATE
Identifies duplicate records in Australian data based on similarities in person name and address data.
dedupe_aus_individual_name_and_date
DEDUPLICATE
Identifies duplicate records in Australian data based on similarities in person name and date data.
dedupe_aus_individual_name_and_email
DEDUPLICATE
Identifies duplicate records in Australian data based on similarities in person name and email address data. The operation uses Individual as the objective type.
dedupe_aus_individual_name_date
DEDUPLICATE
Identifies duplicate records in Australian data based on similarities in person name and date data.
dedupe_aus_individual_name_match
DEDUPLICATE
Identifies duplicate records in Australian data based on similarities in person name data.
dedupe_aus_individual_name_phone
DEDUPLICATE
Identifies duplicate records in Australian data based on person name and telephone number data.
dedupe_aus_individual_name_tfn_match
DEDUPLICATE
Identifies duplicate records in Australian data based on person name and Tax File Numbers (TFN).
dedupe_aus_personal_name_data
DEDUPLICATE
Identifies duplicate records in Australian data based on person name and personal data.
dq_av6_match_code_descriptions_infa
DICTIONARY
Contains alphanumeric status values that indicate the outcome of the verification operation for an address. Each status value has a corresponding text description.
lbl_abn_acn_exception_values
LABELER
Checks if the input value is in the list of Australian Business number exception values and marks the value as invalid if present and valid if not present.
lbl_all_characters
LABELER
Labels English alphabets, character spaces, and symbols.
lbl_aus_bank_code
LABELER
Labels Australian bank codes.
lbl_australia_bank_acct_number
LABELER
Labels Australian bank account numbers.
lbl_Non_Digit_Chars
LABELER
Labels all non-digit characters consistently.
lbl_nzl_bank_acct_number
LABELER
Labels New Zealand bank account numbers.
lbl_nzl_phone
LABELER
Labels New Zealand phone numbers.
lbl_nzl_social_welfare_no
LABELER
Labels a New Zealand Social Welfare Number using a regular expression. No checksum validation as information on checksum is not available.
lbl_word_symbol
LABELER
Labeler identifying Unicode definition of a word combined with one or more symbols. For example: email addresses
mplt_aus_address_validation_hybrid
DMAPPLET
Validates the deliverability of Australian addresses. Use the mapplet when you can connect the input address fields to fields on the Multiline model.
mplt_aus_bank_account_number_parse
DMAPPLET
Parses a 7-digit to 13-digit string as an Australian bank account number from a longer string.
mplt_aus_bank_account_number_standardize
DMAPPLET
Standardizes a 7-digit to 13-digit string as an Australian bank account number, regardless of spacing or punctuation.
If the input value is not between 7 and 13 digits in length, the mapplet returns the input value.
The mapplet returns the numbers without punctuation.
mplt_aus_bank_account_number_validate
DMAPPLET
Validates an Australian bank account number between 7-digit to 13-digit. Checks the format and length of the Australian bank account number.
The mapplet parses strings of numbers in the following formats:
  • nnnnnnn
  • r nnnnnnnnn
  • nnnnnnnnnnnnn
mplt_aus_bsb_parse
DMAPPLET
Parses a 6-digit string as an Australian BSB (Bank-State-Branch) number from a longer string.
The mapplet parses strings of numbers in the following format:
  • 111111
mplt_aus_bsb_standardize
DMAPPLET
Standardizes Australian BSB (Bank-State-Branch) number as indicated by the variable value in the expression.
The mapplet returns strings of numbers in the following format:
  • nnn-nnn
If any alphanumeric character or an input value is not equal to 6 digits, the mapplet returns the input value.
mplt_aus_bsb_validate
DMAPPLET
Validates a 6-digit string as an Australian BSB (Bank-State-Branch) number after checking the length and the format of the number.
The mapplet validates strings of numbers in the following format:
  • 111-111
mplt_aus_business_number_parse
DMAPPLET
Parses 11-digit numeric strings as Australian Business Numbers (ABN) from a string of text.
mplt_aus_business_number_standardize
DMAPPLET
Standardizes a 11-digit Australian Business Numbers (ABN) to
NN NNN NNN NNN
, regardless of spacing or punctuation.
If any alphanumeric character or an input value is less than 11 digit, the mapplet returns the input value.
mplt_aus_business_number_validate
DMAPPLET
Validates Australian Business numbers (ABN).
Rule assumes that the ABN is formatted by mplt_AUS_Business_Number_Standardize or is an 11 digit numeric string.
mplt_aus_company_name_standardize
DMAPPLET
Standardizes a company name and provides the acronym if possible.
mplt_aus_company_number_parse
DMAPPLET
Parses an Australian Company Number (ACN), which is a unique 9-digit identifier assigned by the Australian Securities and Investments Commission (ASIC).
mplt_aus_company_number_standardize
DMAPPLET
Standardizes the format of an input Australian Company Number (ACN), which is a unique 9-digit identifier assigned by the Australian Securities and Investments Commission (ASIC).
mplt_aus_company_number_validate
DMAPPLET
Validates an Australian Company Number (ACN), which is 9-digit number when registered. The Australian Company Number (ACN) must be displayed on all company documents.
mplt_aus_covid_document_number_parse
DMAPPLET
Parses a number that fits the COVID document number format.
mplt_aus_covid_document_number_standardize
DMAPPLET
Standardizes the input to the COVID document number format, the default being
nnnn-nnnn-nnnn
. The option to standardize with a space or no punctuation is available by updating the
Covid_number_Format
variable in the expression asset.
mplt_aus_covid_document_number_validate
DMAPPLET
Validates that the input represents a COVID document number.
mplt_aus_crn_parse
DMAPPLET
Parses an Australian Centrelink Reference Number (CRN) from a string of text. The CRN consists of nine digits and an alphabet at the end.
mplt_aus_crn_standardize
DMAPPLET
Standardizes Australian Centrelink Reference Number (CRN) as indicated by the variable value in the expression.
The mapplet returns strings of numbers in the following formats:
  • No_Punctuation.
    nnnnnnnnn
  • Space.
    nnn nnn nnn
Default is No_Punctuation.
mplt_aus_crn_validate
DMAPPLET
Validates Australian Centrelink Reference Number (CRN). Returns Valid or Invalid.
mplt_aus_family_name_and_address_match
DMAPPLET
Identify duplicate rows in Australian data based on personal surname and address data grouped by the first two digits of the postcode and the surname.
mplt_aus_gender_assignment
DMAPPLET
Assigns gender according to first names. The mapplet returns "M" for male names, "F" for female names, and "U" if the gender is unknown. For example, the mapplet assigns the name "John Smith" a gender of "M" for male.
mplt_aus_given_name_standardize
DMAPPLET
Generates given names from Australian nicknames. For example, the mapplet standardizes the nickname "Bob" to the given name "Robert."
mplt_aus_healthcare_identifier_parse
DMAPPLET
Parses a 16-digit string as an Australian Health Identifier from a longer string.
The Health Identifier contains the following identifiers:
  • IHI. Individual Healthcare Identifier, beginning with 800360.
  • HPI-I. Healthcare Provider Identifier, beginning with 800361.
  • HPI-O. Healthcare Provider Identifier, beginning with 800362.
The mapplet parses strings of numbers in the following format:
  • 800360/1/2 NNNNNNNNNN
mplt_aus_healthcare_identifier_standardize
DMAPPLET
Standardizes Australian Healthcare Identifiers as indicated by the variable value in the expression.
The Health Identifier contains the following identifiers:
  • IHI. Individual Healthcare Identifier.
  • HPI-I. Healthcare Provider Identifier.
  • HPI-O. Healthcare Provider Identifier.
The mapplet returns strings of numbers in the following formats:
  • No_Punctuation.
    nnnnnnnnnnnnnnn
  • Space.
    nnnn nnnn nnnn nnnn
  • Dash.
    nnnn-nnnn-nnnn-nnnn
mplt_aus_healthcare_identifier_validate
DMAPPLET
Validates a 16-digit string as an Australian Health Identifier after checking the length and the format.
The Health Identifier contains the following identifiers:
  • IHI. Individual Healthcare Identifier.
  • HPI-I. Healthcare Provider Identifier.
  • HPI-O. Healthcare Provider Identifier.
The mapplet validates numbers in the following formats:
  • 1111-1111-1111-1111
  • 1111 1111 1111 1111
  • 1111111111111111
mplt_aus_individual_name_and_address_match
DMAPPLET
Identifies duplicate rows based on person names and Australian address data. The mapplet generates group keys from the name data.
mplt_aus_individual_name_and_email_match
DMAPPLET
Identifies duplicate rows based on email addresses and Australian person names. The mapplet generates group keys from the email address data.
mplt_aus_individual_name_and_tfn_match
DMAPPLET
Identifies duplicate rows for Australian data based on Tax File Numbers (TFN) and person names. The mapplet generates group keys from the TFN data.
mplt_aus_individual_name_date_match
DMAPPLET
Identifies duplicate rows based on Australian person names and dates. The mapplet generates group keys from the date data.
mplt_aus_individual_name_match
DMAPPLET
Identifies duplicate rows based on Australian person names. The mapplet generates group keys from the surname values.
mplt_aus_individual_name_phone_match
DMAPPLET
Identifies duplicate rows based on Australian person names and telephone numbers. The mapplet generates group keys from the telephone number data.
mplt_aus_medicare_number_parse
DMAPPLET
Parses an Australian Medicare number from a string of text into its own field. For example, parses
2428778
from
test2428 77813 2-1test
.
mplt_aus_medicare_number_standardize
DMAPPLET
Standardizes a 10-digit or 11-digit string as an Australian Medicare number, regardless of spacing or punctuation.
If input value is not equal to 11-digit, the mapplet returns the input value.
The mapplet returns strings of numbers in the following format:
  • nnnn nnnnn n
mplt_aus_medicare_number_validate
DMAPPLET
Validates a 11-digit number as an Australian Medicare number. Checks the format and length of Australian Medicare number. Checksum validation included.
mplt_aus_personal_name_and_data_match
DMAPPLET
Identifies duplicate rows in Australian data based on person names and personal data. The fields in the personal data column must contain a single type of data, such as a telephone number, email, or Tax File Number. The mapplet generates group keys from the personal data.
mplt_aus_phone_number_parse
DMAPPLET
Parses an Australian telephone number from a string. For example, parses
642123456
from test
642123456test.
mplt_aus_phone_number_standardize
DMAPPLET
Standardizes the format of an Australian phone number country code, area code and landline number to the following formats:
  • Standard.
    +nn (nn) nnnnn
  • Dashes.
    +nn-nnnn-nnnn
  • No spaces.
    +nnnnnnnnnn
mplt_aus_phone_number_validate
DMAPPLET
Validates the area code and length of Australian telephone numbers. The mapplet returns the region of the telephone number as well as codes that indicate if the area code and length of a telephone number is valid.
mplt_aus_tax_file_number_parse
DMAPPLET
Parses a 9-digit string as an Australian Tax File Number (TFN) from a string of text, for example
123456789
or
113-456-789
mplt_aus_tax_file_number_standardize
DMAPPLET
Standardizes Australian Tax File Numbers (TFN).
The mapplet returns strings of numbers in the following formats:
  • No_Punctuation.
    nnnnnnnnn
  • Space.
    nnn nnn nnn
  • Dash.
    nnn-nnn-nnn
mplt_aus_tax_file_number_validate
DMAPPLET
Validates Australia Tax File Number (TFN). For example,
289898765
or
234-345-654
mplt_AV6_Assign_DQ_Status_Code_Description
DMAPPLET
Describes the Match Score value that a verifier asset returns for the Address Verification version 6 engine.
mplt_Luhn_Algorithm
DMAPPLET
Applies the Luhn algorithm, a mod 10 calculation commonly used for credit card number check digit validation, to an input value. Returns Valid if the supplied check digit matches the calculated value. Returns Invalid if the supplied check digit does not match the calculated value.
mplt_nzl_bank_account_number_standardize
DMAPPLET
Standardizes New Zealand bank account number. The domestic account numbers include the 4-digit branch number, 7-digit account, and 3-digit suffix, in the following format:
NN-NNNN-NNNNNNN-NNN
.
mplt_nzl_bank_account_number_validate
DMAPPLET
Parses a 14-digit to 16-digit string as a New Zealand Bank account number or BSB number from a longer string. Checks the format and length of New Zealand Bank account number or BSB number.
The mapplet parses strings of numbers in the following format:
  • 882896855917503
  • 23 2042 5076936 050
  • 04-49313241531-00
mplt_nzl_ird_number_parse
DMAPPLET
Parses 9-digit numeric strings as New Zealand Inland Revenue Department numbers (IRD) from a string of text.
mplt_nzl_ird_number_standardize
DMAPPLET
Standardizes New Zealand Inland Revenue Dept. number (IRD) as indicated by the variable value in the expression.
The mapplet parses strings of numbers in the following format:
  • No_Punctuation.
    nnnnnnnnn
  • Space.
    nnn nnn nnn
  • Dash.
    nnn-nnn-nnn
Default is No_Punctuation.
mplt_nzl_ird_number_validate
DMAPPLET
Validates New Zealand Inland Revenue Department numbers (IRD) number. Returns Valid or Invalid.
mplt_nzl_phone_number_parse
DMAPPLET
Parses a New Zealand telephone number from a string. For example, parses
642123456
from test
642123456test.
mplt_nzl_phone_number_standardize
DMAPPLET
Standardizes the format of a New Zealand phone number country code, area code and landline number to the following formats:
  • Standard.
    +nn (nnn) nnnnn
  • Dashes.
    +nn-nnnn-nnnn
  • No spaces.
    +nnnnnnnnnn
mplt_nzl_phone_number_validate
DMAPPLET
Validates the area code and length of New Zealand telephone numbers. The mapplet returns the region of the telephone number as well as codes that indicate if the area code and length of a telephone number is valid.
mplt_nzl_social_welfare_number_validate
DMAPPLET
Validates a New Zealand Social Welfare Number from a string with limited punctuation. Checks the length and format of the number. No checksum validation as information on checksum is not available.
mplt_Remove_Non_Numbers
DMAPPLET
Removes all characters that are not a digit or number from an input string.
mplt_Remove_Punctuation
DMAPPLET
Removes all punctuation symbols from the input.
mplt_Remove_Punctuation_and_Space
DMAPPLET
Removes all punctuation symbols and character spaces from an input field, and returns the remaining digits and letters.
mplt_Replace_Limited_Punct_with_Space
DMAPPLET
Replaces punctuation symbols including a forward slash, backslash, exclamation point, period, or underscore character with a character space. Also replaces instances of two, three, or four consecutive character spaces with a single space.
mplt_UpperCase
DMAPPLET
Changes characters to uppercase.
nzl_tel_area_codes_infa
DICTIONARY
Contains a list of New Zealand telephone area codes.
p_assign_prename_gender
PARSE
Assigns gender depending on honorifics.
p_aus_area_code
PARSE
Parses an Australian area code.
p_aus_assign_midname_gender
PARSE
Parses the gender for the assigned middle name.
p_aus_bank_acct
PARSE
Parses an Australian Bank account number.
p_aus_bsb_number
PARSE
Parses an Australian BSB (Bank-State-Branch) Number.
p_aus_company_number
PARSE
Parses an Australian Company Number (ACN) from a string.
p_aus_crn
PARSE
Parses an Australian Customer Reference Number (CRN), which consists of nine digits and an alphabet at the end.
p_aus_health_id
PARSE
Parses a 16-digit IHI number.
p_aus_int_country_code
PARSE
Parses the Australia international dialing country codes.
p_aus_medicare_no
PARSE
Parses an Australian Medicare number from a string.
p_aus_phone
PARSE
Parses an Australian phone number.
p_nzl_area_codes
PARSE
Parses specific telephone codes.
p_nzl_int_country_code
PARSE
Parses a New Zealand country code.
p_nzl_phone
PARSE
Parses a New Zealand phone number with the phone code.
p_nzl_phone_number
PARSE
Parses a New Zealand phone number.
p_nzl_phone_parse
PARSE
Parses a New Zealand phone number from a string.
p_parse_abn
PARSE
Parses an Australian Business number from a string of text.
p_parse_aus_covid_number
PARSE
Parses an Australian COVID number.
p_parse_aus_tfn
PARSE
Parses an Australian tax file number (TFN).
p_parse_full_name
PARSE
Parses using the pre-built Australia Multiname parsing.
p_parse_nzl_ird
PARSE
Parses an 8-digit and 9-digit New Zealand Inland Revenue Dept. (IRD) number from a string of text.
v_aus_address_validation_hybrid
VERIFIER
Verifies Australian addresses with input fields that map to the Hybrid model.

0 COMMENTS

We’d like to hear from you!