Developer Guide (On-Premises)

5.12.0
- 5.9.0
- 5.8.1
- 5.8.0
- 5.16.4
- 5.16.3
- 5.16.2
- 5.16.1
- 5.16.0
- 5.15.0
- 5.14.0
- 5.13.0
- 5.11.0
- 5.10.0

Back Next

Transliteration

Transliteration converts data between non-Roman characters and Latin characters. Transliteration also helps you replace diacritical and extended characters with plain text equivalents.

Transliteration helps nonnative speakers find an approximate pronunciation of a word based on the pronunciation rules of their own language. Transliterations of ISO character sets use invertible mapping so that the transliteration can be reversed without any information loss. However, for other character sets such as BGN, transliteration is not reversible.

Informatica Address Verification can transliterate to and from the following writing systems:

Greek (BGN/PCGN 1962, ISO 843 – 1997)

Cyrillic (BGN/PCGN 1947, ISO 9 – 1995)

You can perform Cyrillic transliteration for Belarus, Bulgaria, Kazakhstan, Macedonia, Russia, and Ukraine.

Japanese Katakana, Hiragana, and Kanji

Chinese Pinyin (Mandarin, Cantonese)

Transliteration goes beyond character set mapping, which is a mapping between different numeric representations of a character. A language such as Japanese with the Katakana, Hiragana, and Kanji characters has no direct representation in the English language. However, each Japanese character has a certain associated sound that can be approximated using phonetic Latin characters.

Multiple transliteration schemes are available for different languages. The following table shows transliteration of characters from different languages:

Source Character Set	Input	Destination Character Set	Output
Latin	Ä	ASCII	AE
Latin	ĝ	ASCII	g
Japanese	个	Latin	ka
Cyrillic	Ж	Latin	ZH

Rename Saved Search

Table of Contents

Developer Guide (On-Premises)

Developer Guide (On-Premises)

Transliteration

Transliteration