Integrate the Domain with the Hadoop or Databricks Environment
Integrate the Domain with the Hadoop or Databricks Environment
If you imported the cluster configuration from the Hadoop or Databricks environment during installation, you must complete the integration between the domain and the Hadoop environment. Integration tasks are required in both the Hadoop environment and the Informatica domain environment.
To integrate the domain with the Hadoop environment, you complete the following high-level tasks:
Prepare directories, users, and permissions.
Configure *-site.xml files on the Hadoop or Databricks environment. The properties *-site.xml files must be updated with values required for Informatica processing in the third-party environment.
Refresh the cluster configuration in the Administrator tool. Refresh the cluster configuration to get the updated properties from the *-site.xml files on the cluster.
Update connections in the Administrator tool. Update connections if you want to use property values other than the default values. You will also need to configure environment variables in the connection properties.
For more information about how to import a Hadoop cluster configuration, see the