Release Notes (10.4.1.1)

Release Notes (10.4.1.1)

Configure Amazon Glue as the Hive Metastore

Configure Amazon Glue as the Hive Metastore

You can configure Amazon Glue as the Hive metastore with an Amazon EMR 5.29 cluster.
To enable integration with an EMR cluster with Glue, copy .jar files from the cluster to the domain, and then enable the Hive metastore setting in the hive-site.xml configuration before you refresh the cluster configuration and the Hadoop connection.
Consider the following rules and guidelines:
  • Glue does not support Hive transactions.
  • Kerberos is not supported for the Glue metastore service. You cannot use a Kerberos-enabled EMR cluster with Glue as the Hive metastore.

Copy .jar Files

Copy .jar files from the Amazon EMR cluster to the Informatica domain:
  1. Copy the hive-exec-1.2.1-spark2-amzn-1.jar file from the
    /usr/lib/spark/jars/
    directory of the Glue-enabled EMR 5.29 cluster and paste it in the following location:
    <Informatica installation directory>/services/shared/spark/lib_spark_2.4.3_hadoop_2.7.0
    Overwrite the existing hive-exec-1.2.1-spark2-amzn-1.jar file in the directory.
  2. Copy the aws-glue-datacatalog-spark-client-1.11.0.jar file from the
    /usr/share/aws/hmclient/lib
    directory of the Glue-enabled EMR 5.29 cluster to the following location:
    <Informatica installation directory>/services/shared/spark/lib_spark_2.4.3_hadoop_2.7.0

Configure Glue as the Hive Metastore

Enable support for Glue as the Hive metastore by setting the hive.metastore.uris property in the hive-site.xml cluster configuration file:
  1. Choose one of the following methods to edit the hive-site.xml settings:
    • Retrieve an archived version of the cluster configuration .zip archive and edit the hive-site.xml file within it. You can choose this method if the settings in the .zip archive are still the same as the configuration settings on the cluster.
    • Copy the cluster *-site.xml configuration files to your local machine, edit the hive-site.xml file, and then create the .zip archive. Choose this method if you do not have an existing .zip archive or if the cluster configuration settings have changed since the .zip archive was created.
    In either case, do not add the hive.metastore.uris property to hive-site.xml on the cluster.
  2. Edit the hive-site.xml file to add the hive.metastore.uris property:
    Property
    Value
    hive.metastore.uris
    thrift://<Hive host name>:<port>
  3. Create a .zip archive that contains the set of *-site.xml files, or save the edited .zip archive.
  4. Refresh the cluster configuration on the domain.
    The hive.metastore.uris property is added to the cluster configuration and the associated Hive connection.
For more information about the .zip archive and using it to refresh the cluster configuration, see the cluster configuration chapter in the
Data Engineering Administrator Guide.

More Information

For more information about Amazon Glue, see the following Amazon documentation:
For information about Informatica support for Amazon Glue, see the Product Availability Matrix at https://network.informatica.com/community/informatica-network/product-availability-matrices.

0 COMMENTS

We’d like to hear from you!