Configure Stop Words, Synonyms, and Character Mappings
Configure Stop Words, Synonyms, and Character Mappings
When you perform a search, MDM can ignore common words such as "and", "an", and "is". MDM can also search for synonyms of the search string. For example, when you search for "William", the search result can include the synonyms "Will" and "Willy".
To configure common words to ignore or include synonyms in search results, Informatica provides text files that contain stop words and synonyms, or you can configure your own.
To use the default Elasticsearch analyzers for languages such as Chinese, Japanese, and Korean, Informatica provides a mappings file,
mapping-FoldToASCII.txt
. The character filter of these default analyzers uses the mappings file to convert alphabetic, numeric, and symbolic characters that are not in the Basic Latin Unicode block to their ASCII equivalent.
To get the
stopwords.txt
,
synonyms.txt
,
stopwords_ja.txt
, and
mapping-FoldToASCII.txt
files, contact Informatica Global Customer Support.
To configure stop words, synonyms, and character mappings, perform the following steps:
Create an
analysis
directory in the following location:
<elasticsearch installation directory>/config
Copy the
stopwords.txt
and
synonyms.txt
files to the
analysis
directory.
To configure stop words for languages such as Japanese, create a