Table of Contents


  1. Preface
  2. Introduction to Informatica Big Data Management
  3. Connections
  4. Mappings in a Hadoop Environment
  5. Mapping Objects in a Hadoop Environment
  6. Mappings in the Native Environment
  7. Profiles
  8. Native Environment Optimization
  9. Data Type Reference
  10. Function Reference
  11. Parameter Reference

Sources in a Hadoop Environment

Sources in a Hadoop Environment

You can push a mapping to the Hadoop environment that includes a source from the native environment or from the Hadoop environment. Some sources have limitations when you reference them in the Hadoop environment.
You can run mappings with the following sources in a Hadoop environment:
  • Flat file (native)
  • HBase
  • HDFS complex file
  • HDFS flat file
  • Hive
  • IBM DB2
  • Netezza
  • ODBC
  • Oracle
  • Sqoop sources
  • Teradata
When a mapping runs in the Hadoop environment, an HDFS source or a Hive source cannot reside on a remote cluster. A remote cluster is a cluster that is remote from the machine that the Hadoop connection references in the mapping.

Updated July 03, 2018