Table of Contents

Search

  1. Preface
  2. Introduction to PowerExchange for Teradata Parallel Transporter API
  3. PowerExchange for Teradata Parallel Transporter API Configuration
  4. PowerExchange for Teradata Parallel Transporter Connections
  5. PowerExchange for Teradata Parallel Transporter API Data Objects
  6. PowerExchange for Teradata Parallel Transporter API Mappings
  7. Teradata Parallel Transporter API Run-time Processing
  8. Data Type Reference

PowerExchange for Teradata Parallel Transporter API User Guide

PowerExchange for Teradata Parallel Transporter API User Guide

Teradata Connector for Hadoop Configuration for the Hive Engine

Teradata Connector for Hadoop Configuration for the Hive Engine

When you run a Teradata mapping on the Hive engine, you can use the Teradata Connector for Hadoop (TDCH) Command Line Edition to increase performance. TDCH is a set of API and tools that Teradata Corporation provides for parallel processing of data between Teradata databases and the Hadoop ecosystem of products. You can download TDCH from the Teradata Developer Exchange website.
When you run a Teradata mapping on the Hive engine, by default, the Data Integration Service pushes the mapping to a Hadoop cluster and processes the mapping with one mapper task. You can enable TDCH to run a Teradata mapping on the Hive engine. TDCH uses multiple mapper tasks to read and write the data, which significantly increases the performance.
TDCH uses a configuration file named
InfaTDCHConfig.txt
to read and write data. You can define configuration options in the
InfaTDCHConfig.txt
file to specify how you want TDCH to read and write the data.
To run a Teradata mapping on the Hive Engine with TDCH, perform the following tasks:
  1. Verify the TDCH prerequisites.
  2. Install and configure TDCH.
  3. Configure the
    InfaTDCHConfig.txt
    file.
  4. Configure the EnableTdch custom property.


Updated October 05, 2018