Table of Contents

Search

  1. Preface
  2. Part 1: Installation Getting Started
  3. Part 2: Before You Install the Services
  4. Part 3: Run the Services Installer
  5. Part 4: After You Install the Services
  6. Part 5: Informatica Client Installation
  7. Part 6: Uninstallation
  8. Appendix A: Starting and Stopping Informatica Services
  9. Appendix B: Connecting to Databases from UNIX or Linux

Installation for Enterprise Data Preparation

Installation for Enterprise Data Preparation

Integrate the Domain with the Hadoop Environment

Integrate the Domain with the Hadoop Environment

If you imported the cluster configuration from the Hadoop environment during installation, you must complete the integration between the domain and the Hadoop environment. Integration tasks are required in both the Hadoop environment and the Informatica domain environment.
For information on how to import a Hadoop cluster configuration, refer to the Cluster Configuration topic and the Hadoop Integration section of the
Data Engineering Integration
Guide.
To integrate the domain with the Hadoop environment, you complete the following high-level tasks:
  1. Prepare directories, users, and permissions.
  2. Configure *-site.xml files on the Hadoop environment. The properties *-site.xml files must be updated with values required for Informatica processing in the Hadoop environment.
  3. Refresh the cluster configuration in the Administrator tool. Refresh the cluster configuration to get the updated properties from the *-site.xml files on the cluster.
  4. Update connections in the Administrator tool. Update connections if you want to use property values other than the default values. You will also need to configure environment variables in the Hadoop connection.