Table of Contents

Search

  1. Preface
  2. Part 1: Hadoop Integration
  3. Part 2: Databricks Integration
  4. Appendix A: Connections Reference

Set the Locale for Cloudera CDH 6.x

Set the Locale for Cloudera CDH 6.x

If you want to process data that contains non-ASCII characters, you must integrate the locale setting on Data Engineering Integration with the locale setting on the cluster.
Perform this task in the following situations:
  • You are integrating with a Cloudera CDH 6.x cluster for the first time.
To integrate the locale setting, complete the following tasks:
  1. In the Hadoop connection, navigate to
    Hadoop Cluster Properties
    . As the value for the property
    Cluster Environment Variables
    , configure the locale environment variables, such as the LANG or LC_ALL environment variable.
    The locale setting in the Hadoop connection must match the locale setting that is configured in the domain.
  2. In Cloudera Manager, add the environment variables to the following YARN property:
    yarn.nodemanager.env-whitelist