Prepare to Create the Enterprise Data Lake Services
Prepare to Create the Enterprise Data Lake Services
To create the Enterprise Data Lake services, the domain must be integrated with the Hadoop environment through a domain cluster configuration object.
The Enterprise Data Lake services require connections to the Hadoop environment. The connections are associated with the Hadoop environment through a cluster configuration. The process to integrate the environments and create the services can vary based on the type of installation you choose.
Install Enterprise Data Lake with Informatica Domain Services
If you install the Informatica domain services when you install Enterprise Data Lake and you want to create the Enterprise Data Lake services, you must provide the cluster information during the installation. The installer can import the cluster configuration from the Hadoop environment, and create the connections required by the Enterprise Data Lake services.
Before you run the installer, you need to get import information from the Hadoop administrator. The Hadoop administrator can provide import information to you in one of the following formats:
Cluster authentication information. The Hadoop administrator can provide you with cluster authentication information to connect to the cluster for the import process.
Archive file. The Hadoop administrator can provide you an archive file that contains properties from *-site.xml files on the cluster. If you are importing from Amazon EMR or MapR, you can import only from an archive file.
When the installation completes, you must fully integrate the domain with the Hadoop environment, including a task to refresh the cluster configuration. If you want to complete all integration tasks at one time, you can skip creating the services during installation and create them manually after you integrate the domain with the Hadoop environment.
Install Enterprise Data Lake on a Node with Enterprise Data Catalog
When you install Enterprise Data Lake on a node with Enterprise Data Catalog, you can choose to create the Enterprise Data Lake services. To create the services, the domain must be integrated with the Hadoop environment before you run the installer.
Before you run the installer, verify that the domain is integrated with the Hadoop environment and that the Hadoop, HDFS, and Hive connections are associated with the cluster configuration. For more information about integrating the domain with the Hadoop environment, see
Informatica Big Data Management Hadoop Integration Guide
.
Install Enterprise Data Lake and Enterprise Data Catalog on an Existing Node
When you install Enterprise Data Lake and Enterprise Data Catalog on a domain node, the installer installs the service binaries. The installer does not prompt for any configuration. You must manually create the services after the installation completes.
Before you create the services, verify that the domain is integrated with the Hadoop environment and that the Hadoop, HDFS, and Hive connections are associated with the cluster configuration.