Table of Contents

Search

  1. Preface
  2. Fast Clone Overview
  3. Configuring and Using the Fast Clone Server
  4. Creating Cloning Configuration Files in the Fast Clone Console
  5. Unloading Data from the Source Database
  6. Loading Data to a Target
  7. Remote Configuration Management
  8. Fast Clone Command Line Interface
  9. Troubleshooting
  10. Fast Clone Configuration File Parameters
  11. Glossary

User Guide

User Guide

DataStreamer

DataStreamer

The DataStreamer component is an add-on component that you can purchase for Amazon Redshift, Greenplum, Netezza, Teradata, and Vertica targets. For Greenplum, Netezza, and Teradata targets, DataStreamer is an optional component to load data faster. For Amazon Redshift targets, DataStreamer is a required component that is used transparently and always enabled.
With DataStreamer, you must use the direct path unload method. Depending on the target type, DataStreamer streams the unloaded Oracle data to the target in one of the following ways:
  • For Amazon Redshift targets, DataStreamer sends the unloaded data to the Amazon Simple Storage Service (Amazon S3). After the source data is in Amazon S3 storage, Fast Clone issues a copy command that copies the data to the Amazon Redshift target tables.
    If you plan to run unload jobs on a Windows system, you must install the PostgreSQL ODBC driver on the system. If you plan to run unload jobs on a Linux and UNIX system, use the DataDirect ODBC driver for PostgreSQL that Fast Clone provides.
  • For Greenplum targets, DataStreamer sends the unloaded data directly to the Greenplum parallel file distribution server (gpfdist) for loading to the target.
  • For Netezza targets, DataStreamer writes the unloaded data to the named pipes that represent the Netezza external tables. The Netezza ODBC driver reads the data from these pipes and loads the data to the Netezza target tables.
    To use the Netezza DataStreamer, you must install the Netezza ODBC driver on the system where you plan to run unload jobs.
  • For Teradata targets, DataStreamer sends the unloaded data directly to the Teradata Parallel Data Pump, FastLoad, or MultiLoad utility for loading to the target.
    To use the Teradata DataStreamer, you must install the TPT libraries on the system where you plan to run unload jobs.
  • For Vertica targets, DataStreamer uses the COPY command on the server side or the LCOPY command on the client side to send the unloaded data directly to Vertica targets.
If you use DataStreamer, set the suppress_trailing_nullcols parameter value to false. You can set this parameter in the configuration file or in the Fast Clone Console on the
Runtime Settings
tab >
Format Settings
view. On the
Format Settings
view, clear the
Suppress trailing null columns
option. For more information, see Runtime Parameters.

0 COMMENTS

We’d like to hear from you!