Hi, I'm Ask INFA!
What would you like to know?
ASK INFAPreview
Please to access Ask INFA.

Table of Contents

Search

  1. Preface
  2. Understanding Domains
  3. Managing Your Account
  4. Using Informatica Administrator
  5. Using the Domain View
  6. Domain Management
  7. Nodes
  8. High Availability
  9. Connections
  10. Connection Properties
  11. Domain Object Export and Import
  12. License Management
  13. Monitoring
  14. Log Management
  15. Domain Reports
  16. Understanding Globalization
  17. Appendix A: Code Pages
  18. Appendix B: Custom Roles
  19. Appendix C: Informatica Platform Connectivity
  20. Appendix D: Configure the Web Browser

Administrator Guide

Administrator Guide

Code Page Compatibility

Code Page Compatibility

Compatibility between code pages is essential for accurate data movement when the
CDI-PC Integration Service
runs in the Unicode data movement mode.
A code page can be compatible with another code page, or it can be a subset or a superset of another:
  • Compatible. Two code pages are compatible when the characters encoded in the two code pages are virtually identical. For example, JapanEUC and JIPSE code pages contain identical characters and are compatible with each other. The
    CDI-PC repository
    and
    CDI-PC Integration Service
    process can each use one of these code pages and can pass data back and forth without data loss.
  • Superset. A code page is a superset of another code page when it contains all the characters encoded in the other code page and additional characters not encoded in the other code page. For example, MS Latin1 is a superset of US-ASCII because it contains all characters in the US-ASCII code page.
    Informatica considers a code page to be a superset of itself and all other compatible code pages.
  • Subset. A code page is a subset of another code page when all characters in the code page are also encoded in the other code page. For example, US-ASCII is a subset of MS Latin1 because all characters in the US-ASCII code page are also encoded in the MS Latin1 code page.
For accurate data movement, the target code page must be a superset of the source code page. If the target code page is not a superset of the source code page, the
CDI-PC Integration Service
may not process all characters, resulting in incorrect or missing data. For example, Latin1 is a superset of US-ASCII. If you select Latin1 as the source code page and US-ASCII as the target code page, you might lose character data if the source contains characters that are not included in US-ASCII.
When you install or upgrade a
CDI-PC Integration Service
to run in Unicode mode, you must ensure code page compatibility among the domain configuration database, the Administrator tool,
CDI-PC Client
,
CDI-PC Integration Service
process nodes, the
CDI-PC repository
, the Metadata Manager repository, and the machines hosting
pmrep
and
pmcmd
. In Unicode mode, the
CDI-PC Integration Service
enforces code page compatibility between the
CDI-PC Client
and the
CDI-PC repository
, and between the
CDI-PC Integration Service
process and the
CDI-PC repository
. In addition, when you run the
CDI-PC Integration Service
in Unicode mode, code pages associated with sessions must have the appropriate relationships:
  • For each source in the session, the source code page must be a subset of the target code page. The
    CDI-PC Integration Service
    does not require code page compatibility between the source and the
    CDI-PC Integration Service
    process or between the
    CDI-PC Integration Service
    process and the target.
  • If the session contains a Lookup or Stored Procedure transformation, the database or file code page must be a subset of the target that receives data from the Lookup or Stored Procedure transformation and a superset of the source that provides data to the Lookup or Stored Procedure transformation.
  • If the session contains an External Procedure or Custom transformation, the procedure must pass data in a code page that is a subset of the target code page for targets that receive data from the External Procedure or Custom transformation.
Informatica uses code pages for the following components:
  • Domain configuration database. The domain configuration database must be compatible with the code pages of the
    CDI-PC repository
    and Metadata Manager repository.
  • Administrator tool. You can enter data in any language in the Administrator tool.
  • CDI-PC Client
    . You can enter metadata in any language in the
    CDI-PC Client
    .
  • CDI-PC Integration Service
    process. The
    CDI-PC Integration Service
    can move data in ASCII mode and Unicode mode. The default data movement mode is ASCII, which passes 7-bit ASCII or 8-bit ASCII character data. To pass multibyte character data from sources to targets, use the Unicode data movement mode. When you run the
    CDI-PC Integration Service
    in Unicode mode, it uses up to three bytes for each character to move data and performs additional checks at the session level to ensure data integrity.
  • CDI-PC repository
    . The
    CDI-PC repository
    can store data in any language. You can use the UTF-8 code page for the
    CDI-PC repository
    to store multibyte data in the
    CDI-PC repository
    . The code page for the
    CDI-PC repository
    is the same as the database code page.
  • Sources and targets. The sources and targets store data in one or more languages. You use code pages to specify the type of characters in the sources and targets.
  • CDI-PC
    command line programs. You must also ensure that the code page for
    pmrep
    is a subset of the
    CDI-PC repository
    code page and the code page for
    pmcmd
    is a subset of the
    CDI-PC Integration Service
    process code page.
Most database servers use two code pages, a client code page to receive data from client applications and a server code page to store the data. When the database server is running, it converts data between the two code pages if they are different. In this type of database configuration, the
CDI-PC Integration Service
process interacts with the database client code page. Thus, code pages used by the
CDI-PC Integration Service
process, such as the
CDI-PC repository
, source, or target code pages, must be identical to the database client code page. The database client code page is usually identical to the operating system code page on which the
CDI-PC Integration Service
process runs. The database client code page is a subset of the database server code page.
For more information about specific database client and server code pages, see your database documentation.

0 COMMENTS

We’d like to hear from you!