Table of Contents


  1. Preface
  2. Part 1: Version 10.4.0
  3. Part 2: Version 10.2.2
  4. Part 3: Version 10.2.1
  5. Part 4: Version 10.2
  6. Part 5: Version 10.1.1
  7. Part 6: Version 10.1

Data Integration Service Properties for Hadoop Integration

Data Integration Service Properties for Hadoop Integration

Effective in version 10.2, the Data Integration Service added properties required to integrate the domain with the Hadoop environment.
The following table describes the new properties:
Hadoop Staging Directory
The HDFS directory where the Data Integration Services pushes Informatica Hadoop binaries and stores temporary files during processing. Default is /tmp.
Hadoop Staging User
Required if the Data Integration Service user is empty. The HDFS user that performs operations on the Hadoop staging directory. The user needs write permissions on Hadoop staging directory. Default is the Data Integration Service user.
Custom Hadoop OS Path
The local path to the Informatica Hadoop binaries compatible with the Hadoop operating system. Required when the Hadoop cluster and the Data Integration Service are on different supported operating systems.
Download and extract the Informatica binaries for the Hadoop cluster on the machine that hosts the Data Integration Service. The Data Integration Service uses the binaries in this directory to integrate the domain with the Hadoop cluster.
The Data Integration Service can synchronize the following operating systems:

    SUSE 11 and Redhat 6.5

Changes take effect after you recycle the Data Integration Service.
As a result of the changes in cluster integration, the following properties are removed from the Data Integration Service:
  • Informatica Home Directory on Hadoop
  • Hadoop Distribution Directory
For more information, see the
Informatica 10.2 Hadoop Integration Guide


We’d like to hear from you!