Table of Contents

Search

  1. Preface
  2. Data Profiling
  3. Profiles
  4. Profile results
  5. Tuning data profiling task performance
  6. Troubleshooting

Data Profiling

Data Profiling

Amazon Athena

Amazon Athena

To access an Amazon Athena source object, you must create an Amazon Athena connection to the source object.
Configure the following Amazon Athena connection properties to create and run a data profiling task on an Amazon Athena source object:
Property
Description
Connection Name
Name of the connection.
Each connection name must be unique within the organization. Connection names can contain alphanumeric characters, spaces, and the following special characters: _ . + -,
Maximum length is 255 characters.
Description
Description of the connection. Maximum length is 4000 characters.
Type
The Amazon Athena connection type.
Runtime Environment
Name of the runtime environment where you want to run the tasks.
Authentication Type
The authentication mechanism to connect to Amazon Athena. Select
Permanent IAM Credentials
or
EC2 instance profile
.
Permanent IAM credentials is the default authentication mechanism. Permanent IAM requires an access key and secret key to connect to Amazon Athena.
Use the EC2 instance profile when the Secure Agent is installed on an Amazon Elastic Compute Cloud (EC2) system. This way, you can configure AWS Identity and Access Management (IAM) authentication to connect to Amazon Athena.
For more information about authentication, see Prepare for authentication.
Access Key
Optional. The access key to connect to Amazon Athena.
Secret Key
Optional. The secret key to connect to Amazon Athena.
JDBC URL
The URL of the Amazon Athena connection.
Enter the JDBC URL in the following format:
jdbc:awsathena://AwsRegion=<region_name>;S3OutputLocation=<S3_Output_Location>;
You can use pagination to fetch the Amazon Athena query results. Set the property
UseResultsetStreaming=0
to use pagination.
Enter the property in the following format:
jdbc:awsathena://AwsRegion=<region_name>;S3OutputLocation=<S3_Output_Location>;UseResultsetStreaming=0;
You can also use streaming to improve the performance and fetch the Amazon Athena query results faster. When you use streaming, ensure that port 444 is open.
By default, streaming is enabled.
Customer Master Key ID
Optional. Specify the customer master key ID generated by AWS Key Management Service (AWS KMS) or the Amazon Resource Name (ARN) of your custom key for cross-account access.
You must generate the customer master key ID for the same region where your Amazon S3 bucket resides. You can either specify the customer-generated customer master key ID or the default customer master key ID.
For more information about the Amazon Athena connection properties, see the help for the Amazon Athena connector.

0 COMMENTS

We’d like to hear from you!