Table of Contents

Search

  1. Preface
  2. Introduction to PowerExchange for Amazon Redshift
  3. PowerExchange for Amazon Redshift Configuration
  4. Amazon Redshift Sources and Targets
  5. Amazon Redshift Mappings
  6. Amazon Redshift Pushdown Optimization
  7. Amazon Redshift Sessions
  8. Appendix A: Amazon Redshift Data Type Reference
  9. Appendix B: Troubleshooting

PowerExchange for Amazon Redshift User Guide for PowerCenter

PowerExchange for Amazon Redshift User Guide for PowerCenter

Amazon Redshift Source Session Configuration

Amazon Redshift Source Session Configuration

You can configure a session to read data from Amazon Redshift. Define the properties for each source in the session.
The following table describes the source session properties:
Property
Description
Pre-SQL
The pre-SQL commands to run a query before you read data from Amazon Redshift. You can also use the UNLOAD or COPY command. The command you specify here is processed as a plain text.
Post-SQL
The post-SQL commands to run a query after you write data to Amazon Redshift. You can also use the UNLOAD or COPY command. The command you specify here is processed as a plain text.
Number of Sorted Ports
Number of columns used when sorting rows queried from the source. The PowerCenter Integration Service adds an
ORDER BY
clause to the default query when it reads source rows. The
ORDER BY
clause includes the number of ports specified, starting from the top of the transformation. When you specify the number of sorted ports, the database sort order must match the session sort order.
Default is 0.
Select Distinct
Selects unique values. The PowerCenter Integration Service includes a SELECT DISTINCT statement if you choose this option. Amazon Redshift ignores trailing spaces. Therefore, the PowerCenter Integration Service might extract fewer rows than expected.
Source Table Name
You can override the default source table name.
S3 Bucket Name
Amazon S3 bucket name for the Amazon Redshift source data.
Use an S3 bucket in the same region as your Amazon Redshift cluster.
Enable Compression
Compresses staged files before writing the files to Amazon Redshift.
Session performance improves when the PowerCenter Integration Service compresses the staged files.
Default is selected.
Staging Directory Location
Amazon Redshift staging directory.
Specify a directory on the machine that hosts the PowerCenter Integration Service.
UnloadOptions Property File
Path to the property file.
Enables you to add options to the unload command to write data from an Amazon Redshift object to an S3 bucket.
You can add the following options:
  • DELIMITER
  • PARALLEL
  • ESCAPE
  • AWS_IAM_ROLE
Either specify the path of the property file that contains the unload options or specify the unload options directly in the
UnloadOptions Property File
field. Specify a directory on the machine that hosts the PowerCenter Integration Service.
Turn on S3 Client Side Encryption
Indicates that the PowerCenter Integration Service encrypts data before writing the data to Amazon S3 by using a private encryption key.
Encryption Type
Select the source encryption type. You can select from the following encryption types:
  • SSE-S3
  • SSE-KMS
Default is SSE-S3.
Enable Downloading S3 Files in Multiple Parts
Downloads large Amazon S3 objects in multiple parts.
When the file size of an Amazon S3 object is greater than 8 MB, you can choose to download the object in multiple parts in parallel.
Part Size
Specifies the part size of an object.
Default is 5 MB.
Infa Advanced Filter
SQL filter command to divide the source database into multiple segments.
SQL Query
Overrides the default query. Enclose column names in double quotes. The SQL query is case sensitive. Specify an SQL statement supported by the Amazon Redshift database.
If you add constraints to the table structure in a mapping while the specified constraints are not present in the Amazon Redshift source table, the session fails with an error message.

0 COMMENTS

We’d like to hear from you!