You can configure Amazon Glue as the Hive metastore with an Amazon EMR 5.29 cluster.
To enable integration with an EMR cluster with Glue, copy .jar files from the cluster to the domain, and then enable the Hive metastore setting in the hive-site.xml configuration before you refresh the cluster configuration and the Hadoop connection.
Consider the following rules and guidelines:
Glue does not support Hive transactions.
Kerberos is not supported for the Glue metastore service. You cannot use a Kerberos-enabled EMR cluster with Glue as the Hive metastore.
Copy .jar Files
Copy .jar files from the Amazon EMR cluster to the Informatica domain:
Copy the hive-exec-1.2.1-spark2-amzn-1.jar file from the
/usr/lib/spark/jars/
directory of the Glue-enabled EMR 5.29 cluster and paste it in the following location:
Enable support for Glue as the Hive metastore by setting the hive.metastore.uris property in the hive-site.xml cluster configuration file:
Choose one of the following methods to edit the hive-site.xml settings:
Retrieve an archived version of the cluster configuration .zip archive and edit the hive-site.xml file within it. You can choose this method if the settings in the .zip archive are still the same as the configuration settings on the cluster.
Copy the cluster *-site.xml configuration files to your local machine, edit the hive-site.xml file, and then create the .zip archive. Choose this method if you do not have an existing .zip archive or if the cluster configuration settings have changed since the .zip archive was created.
In either case, do not add the hive.metastore.uris property to hive-site.xml on the cluster.
Edit the hive-site.xml file to add the hive.metastore.uris property:
Property
Value
hive.metastore.uris
thrift://<Hive host name>:<port>
Create a .zip archive that contains the set of *-site.xml files, or save the edited .zip archive.
Refresh the cluster configuration on the domain.
The hive.metastore.uris property is added to the cluster configuration and the associated Hive connection.
For more information about the .zip archive and using it to refresh the cluster configuration, see the cluster configuration chapter in the
Data Engineering Administrator Guide.
More Information
For more information about Amazon Glue, see the following Amazon documentation: