Table of Contents

Search

  1. Hadoop Integration
  2. Databricks Integration
  3. Connections

Integration Guide

Integration Guide

Configure the Files for Hive Tables on S3

Configure the Files for Hive Tables on S3

To run mappings with Hive sources or targets on S3, you need to configure the files from the master node to the Data Integration Service machine.
Perform this task in the following situations:
  • You are integrating for the first time.
  • You upgraded from any Informatica version and changed the distribution version.
You can perform one of the following steps to configure the files:
Copy the .jar file
To integrate with EMR 5.16, get
emrfs-hadoop-assembly-2.25.0.jar
from the Hadoop administrator. Copy
the file to the following locations on each Data Integration Service machine:
/<Informatica installation directory>/services/shared/hadoop/EMR_<version number>/lib
/<Informatica installation directory>/services/shared/hadoop/EMR_<version number>/extras/hive-auxjars
If you upgraded from EMR 5.10 to EMR 5.14, the part of the file path that includes
EMR_<version number>
remains
EMR_5.10
.
Create a file
Create a
~/.aws/config
on the Data Integration Service machine. The file must contain AWS location.
For example,
[default] region=us-west-2
Create an environment variable
Create
AWS_CONFIG_FILE
environment variable on the Data Integration Service machine. Set the value to
<EMR_5.10>/conf/aws.default

0 COMMENTS

We’d like to hear from you!