Before you create an Open Table connection, complete the prerequisites.
Using AWS Glue Catalog and Amazon S3 Storage to interact with Apache Iceberg or
Delta Lake tables
If you use an AWS Glue Catalog and Amazon S3
Storage to interact with Apache Iceberg or Delta Lake tables, you need to have
access to the following AWS services that manage the tables on AWS:
AWS Glue Catalog: AWS Glue Catalog
manages the metadata associated with the Apache Iceberg or Delta Lake
tables.
Amazon S3 Storage: Amazon S3 stores
the Apache Iceberg or Delta Lake tables containing actual records in
columnar format, organized in partitioned directories.
Amazon Athena: Amazon Athena uses the AWS Glue
Data Catalog to store metadata such as table and column names for your data
stored in Amazon S3. Open Table Connector uses the Amazon Athena JDBC driver
to connect to the AWS Glue Catalog to access Apache Iceberg or Delta Lake
tables metadata.
You need to create separate policies to access these services.
Using Hive Metastore catalog and Microsoft
Azure Data Lake Storage Gen2 to interact with Apache Iceberg tables
If you use a Hive Metastore catalog and Microsoft Azure Data Lake Storage Gen2 to interact
with Apache Iceberg tables, you need to have access to the following services that
manage the tables on Microsoft Azure Data Lake Storage Gen2:
Hive Metastore catalog: Hive Metastore
catalog manages the metadata associated with the Apache Iceberg tables. The
metastore must use Hive version 4.0.
Microsoft Azure Data Lake
Storage Gen2: Microsoft Azure Data Lake Storage Gen2 stores the Apache
Iceberg tables containing actual records in columnar format, organized in
partitioned directories.
Hive JDBC driver: Hive JDBC driver connects
to the Hive server to access the metadata of Apache Iceberg tables.
Using Hive Metastore catalog and Amazon S3 storage to interact with Apache
Iceberg tables
If you use a Hive Metastore catalog and Amazon S3 storage to interact with Apache Iceberg
tables, you need to have access to the following services that manage the tables on
Amazon S3 storage:
Hive Metastore catalog: Hive Metastore
catalog manages the metadata associated with the Apache Iceberg tables. The
metastore must use Hive version 4.0.
Amazon S3 storage: Amazon S3
stores the Apache Iceberg tables containing actual records in columnar
format, organized in partitioned directories.
Hive JDBC driver: Hive JDBC driver connects
to the Hive server to access the metadata of Apache Iceberg
tables.
Using REST catalog and Amazon S3 to interact with Apache Iceberg tables
If you use a REST catalog such as Polaris catalog and Amazon S3 storage to interact with
Apache Iceberg tables, you need to have access to the following services that manage
the tables on Amazon S3 storage:
REST catalog: REST catalog
manages the metadata associated with the Apache Iceberg tables.
Amazon S3 storage: Amazon S3
stores the Apache Iceberg tables containing actual records in columnar format,
organized in partitioned directories.