Through Big Data Management®, you can integrate the Informatica domain with the Hadoop environment or the Databricks environment. Supported Hadoop distributions include Amazon EMR, Azure HDInsight, Cloudera CDH, Hortonworks HDP, and MapR.
The integration process for each distribution includes the following high-level tasks:
Before you begin tasks
Complete these tasks to prepare the Hadoop or Databricks and domain environments for integration.
Integration tasks
Complete these tasks to import cluster configuration to the domain, create and configure connections, and enable mappings to run in the Hadoop or Databricks environment.
If you are integrating the domain with a Hadoop cluster for the first time or with Azure Databricks, refer to the integration task flow diagram for the distribution. If you are upgrading from a previous version of Big Data Management in a Hadoop environment, refer to the upgrade task flow diagrams for the distribution.