Connections

Back Next

Prerequisites

Before you create an Open Table connection, complete the prerequisites.

Using AWS Glue Catalog and Amazon S3 Storage to interact with Apache Iceberg or Delta Lake tables

If you use an AWS Glue Catalog and Amazon S3 Storage to interact with Apache Iceberg or Delta Lake tables, you need to have access to the following AWS services that manage the tables on AWS:

AWS Glue Catalog: AWS Glue Catalog manages the metadata associated with the Apache Iceberg or Delta Lake tables.

Amazon S3 Storage: Amazon S3 stores the Apache Iceberg or Delta Lake tables containing actual records in columnar format, organized in partitioned directories.

Amazon Athena: Amazon Athena uses the AWS Glue Data Catalog to store metadata such as table and column names for your data stored in Amazon S3. Open Table Connector uses the Amazon Athena JDBC driver to connect to the AWS Glue Catalog to access Apache Iceberg or Delta Lake tables metadata.

You need to create separate policies to access these services.

Using Hive Metastore catalog and Microsoft Azure Data Lake Storage Gen2 to interact with Apache Iceberg tables

If you use a Hive Metastore catalog and Microsoft Azure Data Lake Storage Gen2 to interact with Apache Iceberg tables, you need to have access to the following services that manage the tables on Microsoft Azure Data Lake Storage Gen2:

Hive Metastore catalog: Hive Metastore catalog manages the metadata associated with the Apache Iceberg tables. The metastore must use Hive version 4.0.

Microsoft Azure Data Lake Storage Gen2: Microsoft Azure Data Lake Storage Gen2 stores the Apache Iceberg tables containing actual records in columnar format, organized in partitioned directories.

Hive JDBC driver: Hive JDBC driver connects to the Hive server to access the metadata of Apache Iceberg tables.

Using Hive Metastore catalog and Amazon S3 storage to interact with Apache Iceberg tables

If you use a Hive Metastore catalog and Amazon S3 storage to interact with Apache Iceberg tables, you need to have access to the following services that manage the tables on Amazon S3 storage:

Hive Metastore catalog: Hive Metastore catalog manages the metadata associated with the Apache Iceberg tables. The metastore must use Hive version 4.0.

Amazon S3 storage: Amazon S3 stores the Apache Iceberg tables containing actual records in columnar format, organized in partitioned directories.

Hive JDBC driver: Hive JDBC driver connects to the Hive server to access the metadata of Apache Iceberg tables.

Using REST catalog and Amazon S3 to interact with Apache Iceberg tables

If you use a REST catalog such as Polaris catalog and Amazon S3 storage to interact with Apache Iceberg tables, you need to have access to the following services that manage the tables on Amazon S3 storage:

REST catalog: REST catalog manages the metadata associated with the Apache Iceberg tables.

Amazon S3 storage: Amazon S3 stores the Apache Iceberg tables containing actual records in columnar format, organized in partitioned directories.

Open Table connection properties

Create minimal IAM policies

Install the JDBC driver

Configure EC2 role to assume role

Download Guide

Watch

Comments

Communities

Knowledge Base

Success Portal

0 COMMENTS

We’d like to hear from you! Log in to comment.

Rename Saved Search

Table of Contents

Connections

Connections

Prerequisites

Prerequisites

Using AWS Glue Catalog and Amazon S3 Storage to interact with Apache Iceberg or Delta Lake tables

Using Hive Metastore catalog and Microsoft Azure Data Lake Storage Gen2 to interact with Apache Iceberg tables

Using Hive Metastore catalog and Amazon S3 storage to interact with Apache Iceberg tables

Using REST catalog and Amazon S3 to interact with Apache Iceberg tables