Table of Contents


  1. Preface
  2. Part 1: Hadoop Integration
  3. Part 2: Databricks Integration
  4. Appendix A: Managing Distribution Packages
  5. Appendix B: Connections Reference

Prepare the Archive File for Amazon EMR

Prepare the Archive File for Amazon EMR

After you verify property values in the *-site.xml files, create a .zip or a .tar file that the Informatica administrator can use to import the cluster configuration into the domain.
Create an archive file that contains the following files from the cluster:
  • core-site.xml
  • hbase-site.xml. Required only if you access HBase sources and targets.
  • hdfs-site.xml
  • hive-site.xml
  • mapred-site.xml or tez-site.xml. Include the mapred-site.xml file or the tez-site.xml file based on the Hive execution type used on the Hadoop cluster.
  • yarn-site.xml
To import from Amazon EMR, the Informatica administrator must use an archive file.


We’d like to hear from you!