Table of Contents

Search

  1. Preface
  2. Getting started with Informatica Address Verification (On-Premises)
  3. General Settings
  4. Input Parameters
  5. Process Parameters
  6. Address Enrichments
  7. Result Parameters
  8. Output Fields
  9. Assessment Codes and Return Codes
  10. Frequently Asked Questions
  11. Geocode Countries

Developer Guide (On-Premises)

Developer Guide (On-Premises)

Transliteration

Transliteration

Transliteration converts data between non-Roman characters and Latin characters. Transliteration also helps you replace diacritical and extended characters with plain text equivalents.
Transliteration helps nonnative speakers find an approximate pronunciation of a word based on the pronunciation rules of their own language. Transliterations of ISO character sets use invertible mapping so that the transliteration can be reversed without any information loss. However, for other character sets such as BGN, transliteration is not reversible.
Informatica Address Verification can transliterate to and from the following writing systems:
  • Greek (BGN/PCGN 1962, ISO 843 – 1997)
  • Cyrillic (BGN/PCGN 1947, ISO 9 – 1995)
    You can perform Cyrillic transliteration for Belarus, Bulgaria, Kazakhstan, Macedonia, Russia, and Ukraine.
  • Japanese Katakana, Hiragana, and Kanji
  • Chinese Pinyin (Mandarin, Cantonese)
Transliteration goes beyond character set mapping, which is a mapping between different numeric representations of a character. A language such as Japanese with the Katakana, Hiragana, and Kanji characters has no direct representation in the English language. However, each Japanese character has a certain associated sound that can be approximated using phonetic Latin characters.
Multiple transliteration schemes are available for different languages. The following table shows transliteration of characters from different languages:
Source Character Set
Input
Destination Character Set
Output
Latin
Ä
ASCII
AE
Latin
ĝ
ASCII
g
Japanese
Latin
ka
Cyrillic
Ж
Latin
ZH