Table of Contents

Search

  1. Preface
  2. Introduction to Accelerators
  3. Core Accelerator
  4. Data Domains Accelerator
  5. Australia/New Zealand Accelerator
  6. BCBS 239/CCAR Accelerator
  7. Brazil Accelerator
  8. Financial Services Accelerator
  9. France Accelerator
  10. Germany Accelerator
  11. India Accelerator
  12. Italy Accelerator
  13. Portugal Accelerator
  14. Spain Accelerator
  15. United Kingdom Accelerator
  16. U.S./Canada Accelerator

Accelerator Guide

Accelerator Guide

Data Rules in the Data Domains Accelerator

Data Rules in the Data Domains Accelerator

Use the data domain data rules to identify columns that contain data that matches the rule criteria.
Find the data rules in the following repository location:
[Informatica_DQ_Content]\Domain_Discovery\Data_Rules
The following table describes the data rules in the Data Domains accelerator:
Name
Description
dataDomain_DataRule_ABARoutingNumber
Identifies column data that matches the format of an American Banking Association routing number. The routing number identifies a financial institution in a financial transaction.
dataDomain_DataRule_Account_Status
Identifies column data that matches account status values in the reference data.
dataDomain_DataRule_Address_Data
Identifies column data that represents address information. The rule recognizes address data from multiple countries globally.
dataDomain_DataRule_Age
Identifies column data with values from 1 through 120.
dataDomain_DataRule_Alphanumeric_SpecialCharacter
Identifies column data that contains unformatted alphanumeric data and special-character data.
dataDomain_DataRule_Amount
Identifies column data that represents a physical quantity.
dataDomain_DataRule_AUT_NATID
Identifies column data that matches the Austrian national ID format.
dataDomain_DataRule_BankAccount_USA
Identifies column data that matches a bank account number format in the United States.
dataDomain_DataRule_BGR_NATID
Identifies column data that matches the Bulgarian national ID format.
dataDomain_DataRule_BIC_SWIFTCode
Identifies column data that matches Bank Identifier Code (BIC) or Society for Worldwide Interbank Financial Telecommunication (SWIFT) code by pattern recognition and country code.
dataDomain_DataRule_BinaryValues
Identifies column data that contains binary values.
dataDomain_DataRule_BirthDay
Identifies column data that matches valid birth dates. The rule verifies the number of years between the input date and current date. The rule returns "Adult," "Minor," or "Valid" based on the values from 1 through 120. The rule returns "Invalid" for all other values.
dataDomain_DataRule_BRA_IDDoc
Identifies column data that matches the number format of the
Registro Geral
ID card in Brazil.
dataDomain_DataRule_BRA_Personal_ID
Identifies column data that matches the Brazilian personal ID format.
dataDomain_DataRule_CAN_SIN
Identifies column data that matches the Social Insurance number format in Canada.
dataDomain_DataRule_CHN_NATID
Identifies column data that matches the Chinese national ID format.
dataDomain_DataRule_City
Identifies column data that contains a valid city name. The rule reads reference data that contains international city names.
dataDomain_DataRule_CompanyName
Identifies column data that matches the organization-name values in the reference data.
dataDomain_DataRule_Computer_Address
Identifies column data that matches the format of IP addresses and MAC addresses.
dataDomain_DataRule_Country
Identifies column data that matches an ISO country name.
dataDomain_DataRule_CountryCode_Phone
Identifies column data that matches phone numbers based on international dialing codes.
dataDomain_DataRule_County
Identifies column data that matches a United States county name.
dataDomain_DataRule_CreditCard_AMEX
Identifies column data that matches the American Express credit card number format.
dataDomain_DataRule_CreditCard_DinersCard
Identifies column data that matches the Diners Club International credit card number format.
dataDomain_DataRule_CreditCard_DiscoverCard
Identifies column data that matches the Discover credit card number format.
dataDomain_DataRule_CreditCard_JCB
Identifies column data that matches the JCB International credit card number format.
dataDomain_DataRule_CreditCard_MasterCard
Identifies column data that matches the MasterCard credit card number format.
dataDomain_DataRule_CreditCard_Visa
Identifies column data that matches the Visa credit card number format.
dataDomain_DataRule_CreditCardNumber
Identifies column data that matches the credit card number format of major credit card organizations, such as American Express, Diners Club International, and Maestro.
dataDomain_DataRule_CreditCardTrack1FormatB
Identifies column data that matches Track 1 Format B credit card information.
dataDomain_DataRule_Currency
Identifies column data that matches a currency term in the reference data.
dataDomain_DataRule_Date_Validation
Identifies the date strings in the source data that appear in a single format in a date column. To configure the date format that the rule uses for validation, open the dq_ValidateDate Expression transformation in the rule and update the In_Date_Format expression variable. The default format is "MM/DD/YYYY." The rule returns "Valid" or "Invalid."
dataDomain_DataRule_Date_Validation_All_Formats
Identifies the date values in the column data and standardizes the column data to a single date format.
dataDomain_DataRule_DEU_Machine_Readable_Passport
Identifies column data that matches the machine-readable German passport number format.
dataDomain_DataRule_DNK_NATID
Identifies column data that matches the Danish national ID format.
dataDomain_DataRule_DriversLicense
Identifies column data that matches Canada, United Kingdom, and Unites States driver license numbers based on the length and pattern of the data values.
dataDomain_DataRule_DriversLicense_Canada
Identifies column data that matches Canada driver license numbers except for numbers from the provinces of British Columbia, Quebec, Manitoba, and Prince Edward Island.
dataDomain_DataRule_DriversLicense_Canada_narrow
Identifies column data that matches Canada driver license numbers except for numbers from the provinces of British Columbia, Quebec, Manitoba, and Prince Edward Island.
The rule is similar to the dataDomain_DataRule_DriversLicense_Canada rule. However, dataDomain_DataRule_DriversLicense_Canada_narrow performs a more narrow analysis to reduce the likelihood of false positives.
dataDomain_DataRule_DriversLicense_GBR
Identifies column data that matches United Kingdom driver license numbers.
dataDomain_DataRule_DriversLicense_narrow
Identifies column data that matches driver license numbers from the United Kingdom and from many states and provinces in Canada and the United States.
The rule does not validate numbers from the provinces of British Columbia, Quebec, Manitoba, and Prince Edward Island.
To reduce the likelihood of false positives, the rule does not validate numbers that contain between four and eight digits.
dataDomain_DataRule_DriversLicense_USA
Identifies column data that matches the driver license numbers of most of the states in the United States.
dataDomain_DataRule_DriversLicense_USA _narrow
Identifies column data that matches the driver license numbers of most of the states in the United States.
To reduce the likelihood of false positives, the rule excludes data values that comprise between six and eight digits. For example, the rule excludes a value such as 01012017.
dataDomain_DataRule_Email
Identifies column data that matches a predefined email ID format.
dataDomain_DataRule_ExpirationDate
Identifies column data that matches expired credit card dates. The rule compares the input date to the system date for validation.
dataDomain_DataRule_FIN_NATID
Identifies column data that matches the Finnish national ID format.
dataDomain_DataRule_FirstName
Identifies column data that matches values in a reference data set of first names.
dataDomain_DataRule_FRA_INSEE
Identifies column data that matches the French Institut National de la Statistique et des Études Économiques (INSEE) number format.
dataDomain_DataRule_FullName
Identifies the strings in a column of data that contain first, middle, and last names. The rule compares the words in each string to the reference data.
dataDomain_DataRule_GBR_NINO
Identifies column data that matches the United Kingdom National Insurance number format.
dataDomain_DataRule_GBR_Passport_Number
Identifies column data that matches the United Kingdom passport number format.
dataDomain_DataRule_Gender
Identifies column data that matches the gender values in the reference data.
dataDomain_DataRule_Height
Identifies column data with values 1 through 8, where 8 represents feet in height.
dataDomain_DataRule_HostName
Identifies column data that matches valid host names.
dataDomain_DataRule_HRV_NATID
Identifies column data that matches the Croatian national ID format.
dataDomain_DataRule_IBAN
Identifies column data that matches the International Bank Account Number format for multiple European countries.
dataDomain_DataRule_ICD_10
Identifies column data that matches the names of conditions in the tenth revision of the International Statistical Classification of Diseases and Related Health Problems (ICD). The World Health Organization (WHO) maintains the classification.
dataDomain_DataRule_ICD_9
Identifies column data that matches the names of conditions in the ninth revision of the International Statistical Classification of Diseases and Related Health Problems (ICD). The World Health Organization (WHO) maintains the classification.
dataDomain_DataRule_IND_NATID
Identifies column data that matches the Indian Permanent Account Number format.
dataDomain_DataRule_IND_Passport
Identifies column data that matches the Indian passport number format.
dataDomain_DataRule_IPAddress
Identifies column data that matches a predefined IP address format.
dataDomain_DataRule_ISBN
Identifies column data that matches the International Standard Book Number format.
dataDomain_DataRule_ISIN
Identifies column data that matches the international securities identification number (ISIN) format. An ISIN uniquely identifies a security such as a stock or a bond.
dataDomain_DataRule_ItalyFiscalCode
Identifies column data that matches the Italian national ID format.
dataDomain_DataRule_ITIN_USA
Identifies column data that matches the format of an Individual Taxpayer Identification Number (ITIN) in the United States. The Internal Revenue Service issues the identification numbers.
dataDomain_DataRule_JobPosition
Identifies column data that matches the job position names in the reference data.
dataDomain_DataRule_KOR_NATID
Identifies column data that matches the Korean national ID format.
dataDomain_DataRule_LastName
Identifies column data that matches values in a reference data set of last names.
dataDomain_DataRule_Latitude
Identifies column data that matches valid latitude coordinates.
dataDomain_DataRule_LatitudeLongitude
Identifies column data that matches valid pairs of latitude and longitude coordinates, where each pair is separated by a semicolon.
dataDomain_DataRule_Longitude
Identifies column data that matches valid longitude coordinates.
dataDomain_DataRule_Machine_Readable_Passport
Identifies column data that matches machine-readable passport numbers from all countries.
dataDomain_DataRule_NDC_USA
Identifies column data that matches a National Drug Code (NDC) value in the National Drug Code directory in the United States. Each code uniquely identifies a drug that a manufacturer developed for human use.
dataDomain_DataRule_NOR_NATID
Identifies column data that matches the Norwegian national ID format.
dataDomain_DataRule_NPI_USA
Identifies column data that matches a National Provider Identifier (NPI) number in the United States. The Centers for Medicare and Medicaid Services issue the numbers to healthcare providers.
dataDomain_DataRule_PhoneNumber
Identifies column data that matches the United States phone number format.
dataDomain_DataRule_PostCode
Identifies column data that matches the postal codes of multiple countries.
dataDomain_DataRule_Quantity
Identifies column data that describes a physical quantity and includes units of measurement.
dataDomain_DataRule_Race
Identifies column data that matches the name of a race of people in the reference data.
dataDomain_DataRule_Religion
Identifies column data that matches the name of a religion in the reference data.
dataDomain_DataRule_ROU_NATID
Identifies column data that matches the Romanian national ID format.
dataDomain_DataRule_SouthAfrica_NATID
Identifies column data that matches the South African national ID format.
dataDomain_DataRule_Spanish_NIF
Identifies column data that matches the format of the fiscal identification number (NIF) in Spain.
dataDomain_DataRule_SSN
Identifies column data that matches the United States Social Security number format.
dataDomain_DataRule_State
Identifies column data that matches the state names in the United States.
dataDomain_DataRule_Street
Identifies the strings in the column data that describe street address information, for example street, road, avenue. The rule uses a regular expression to find street descriptors in the column data.
dataDomain_DataRule_SWE_NATID
Identifies column data that matches the Swedish national ID format.
dataDomain_DataRule_TWN_NATID
Identifies column data that matches the Taiwanese national ID format.
dataDomain_DataRule_UPC
Identifies column data that matches a valid Universal Product Code. A Universal Product Code is a type of barcode.
dataDomain_DataRule_UPC_EAN
Identifies column data that matches a valid Universal Product Code or European Article Number. Universal Product Codes and European Article Numbers are types of barcode.
dataDomain_DataRule_URL
Identifies column data that matches predefined URL formats.
dataDomain_DataRule_US_Zip5
Identifies column data that matches United States ZIP Codes.
dataDomain_DataRule_USA_Machine_Readable_Passport
Identifies column data that matches a machine-readable United States passport number format.
dataDomain_DataRule_USA_SSN_post_2011June
Identifies column data that matches the Social Security number format in length, numeric values, and minimum and maximum values of the area, group, and serial number sections. Based on the SSN Randomization initiative, effective June 25, 2011, the rule does not verify the issuance of a Social Security number and the group and area number combination.
dataDomain_DataRule_Weight
Identifies column data that describes a weight value. The rule checks for a number between 0 and 500.
dataDomain_DataRule_ZipCode
Identifies column data that matches United States ZIP Codes.

0 COMMENTS

We’d like to hear from you!