PowerExchange for Hive User Guide

PowerExchange for Hive User Guide

Advanced Properties

Advanced Properties

The advanced properties includes the write properties used to write data to the target. You can specify properties such as SQL commands.
The file path in an SQL command depends on the type of the run-time environment. If you run the mapping in the native environment, the file path must be relative to the host that you specified in the Hive connection. If you run the mapping in the Hadoop environment, the file path must be relative to the machine that hosts the Data Integration Service for the Hive environment type.
The following table describes the advanced properties that you configure for Hive targets:
Property
Description
Tracing level
Controls the amount of detail in the mapping log file.
Target Schema Strategy
Type of target schema strategy for the target table.
You can select one of the following target schema strategies:
  • RETAIN - Retain existing target schema
  • CREATE - Create or replace table at run time
  • APPLYNEWCOLUMNS - Alter table and apply new columns only
  • APPLYNEWSCHEMA - Alter table and apply new schema
  • FAIL - Fail mapping if target schema is different
  • Assign Parameter
DDL query for create or replace
The DDL query based on which the Data Integration Service creates or replaces the target table.
This option is applicable when you select the
CREATE - Create or replace table at run time
target schema strategy.
By default, a Hive table is created with ORC storage format on the Hortonworks HDP 3.1 distribution.
Truncate target table
Truncates the target before loading data.
Default is enabled.
Truncate target partition
Truncates an internal or external partitioned Hive target before loading data. You must select the
Truncate target table
option before you select this option.
This option is applicable when you run the mapping in the Hadoop environment.
Default is disabled.
PreSQL
SQL command that the Data Integration Service runs against the target database before it reads the source. Only the Spark engine can run PreSQL commands against Hive targets.
The Developer tool does not validate the SQL.
PostSQL
SQL command that the Data Integration Service runs against the target database after it writes to the target. Only the Spark engine can run PostSQL commands against Hive targets.
The Developer tool does not validate the SQL.
Maintain row order
Maintains row order while writing data to the target. Select this option if the Data Integration Service should not perform any optimization that can change the row order.
When the Data Integration Service performs optimizations, it might lose the row order that was established earlier in the mapping. When you configure a target to maintain row order, the Data Integration Service does not perform optimizations for the target.

0 COMMENTS

We’d like to hear from you!