When you run a Teradata mapping on the Hive engine, you can use the Teradata Connector for Hadoop (TDCH) Command Line Edition to increase performance. TDCH is a set of API and tools that Teradata Corporation provides for parallel processing of data between Teradata databases and the Hadoop ecosystem of products. You can download TDCH from the Teradata Developer Exchange website.
When you run a Teradata mapping on the Hive engine, by default, the Data Integration Service pushes the mapping to a Hadoop cluster and processes the mapping with one mapper task. You can enable TDCH to run a Teradata mapping on the Hive engine. TDCH uses multiple mapper tasks to read and write the data, which significantly increases the performance.