Table of Contents

Search

  1. Preface
  2. Part 1: Installation Getting Started
  3. Part 2: Before You Install the Services
  4. Part 3: Run the Services Installer
  5. Part 4: After You Install the Services
  6. Part 5: Informatica Client Installation
  7. Part 6: Uninstallation
  8. Appendix A: Starting and Stopping Informatica Services
  9. Appendix B: Connecting to Databases from UNIX or Linux

Installation for Enterprise Data Preparation

Installation for Enterprise Data Preparation

Prepare for Archive File Import with a Full Installation

Prepare for Archive File Import with a Full Installation

The Hadoop administrator might choose to provide you with a .zip or .tar archive file instead of with direct connection information.
If you are integrating with an Amazon EMR, MapR, or Google Dataproc cluster, you must import the cluster configuration through an archive file.
Get an archive file that contains the following *-site.xml files from the cluster:
  • core-site.xml
  • hbase-site.xml. Required only if you access HBase sources and targets.
  • hdfs-site.xml
  • hive-site.xml
  • mapred-site.xml or tez-site.xml. Include the mapred-site.xml file or the tez-site.xml file based on the Hive execution type used on the Hadoop cluster.
  • yarn-site.xml
Verify that the Hadoop administrator creates an archive file from all the listed *-site.xml files.
After creating the archive file, the Hadoop administrator needs to edit it for the following distributions:
Azure HDInsight
Edit the Hortonworks Data Platform (HDP) version string wherever it appears in the archive file. Search for the string
${hdp.version}
and replace all instances with the HDP version that HDInsight includes in the Hadoop distribution.
Hortonworks HDP
Edit the Hortonworks Data Platform (HDP) version string wherever it appears in the archive file. Search for the string
${hdp.version}
and replace all instances with the HDP version that Hortonworks includes in the Hadoop distribution.