Table of Contents

Search

  1. Introduction
  2. Configuring Hub Console Tools
  3. Building the Data Model
  4. Configuring the Data Flow
  5. Executing Informatica MDM Hub Processes
  6. Configuring Application Access
  7. MDM Hub Properties
  8. Viewing Configuration Details
  9. Search with Solr
  10. Row-level Locking
  11. MDM Hub Logging
  12. Table Partitioning
  13. Collecting MDM Environment Information with the Product Usage Toolkit
  14. Glossary

Configure Stop Words, Synonyms, and Character Mappings

Configure Stop Words, Synonyms, and Character Mappings

When you perform a search, MDM can ignore common words such as "and", "an", and "is". MDM can also search for synonyms of the search string. For example, when you search for "William", the search result can include the synonyms "Will" and "Willy".
To configure common words to ignore or include synonyms in search results, Informatica provides text files that contain stop words and synonyms, or you can configure your own.
To use the default Elasticsearch analyzers for languages such as Chinese, Japanese, and Korean, Informatica provides a mappings file,
mapping-FoldToASCII.txt
. The character filter of these default analyzers uses the mappings file to convert alphabetic, numeric, and symbolic characters that are not in the Basic Latin Unicode block to their ASCII equivalent.
To get the
stopwords.txt
,
synonyms.txt
,
stopwords_ja.txt
, and
mapping-FoldToASCII.txt
files, contact Informatica Global Customer Support.
To configure stop words, synonyms, and character mappings, perform the following steps:
  1. Create an
    analysis
    directory in the following location:
    <elasticsearch installation directory>/config
  2. Copy the
    stopwords.txt
    and
    synonyms.txt
    files to the
    analysis
    directory.
  3. To configure stop words for languages such as Japanese, create a
    lang
    directory in the following location:
    <elasticsearch installation directory>/config/analysis
  4. Copy the stop words files for other languages, such as
    stopwords_ja.txt
    , and the
    mapping-FoldToASCII.txt
    file to the
    lang
    directory.

0 COMMENTS

We’d like to hear from you!