Rules and guidelines for adding partitioning columns
Rules and guidelines for adding partitioning columns
Consider the following general rules and guidelines when you include partitioned columns to a target created at runtime:
You cannot use an existing table to add partitioned columns.
Parallel processing is not applicable for Hive Connector.
If you specify a value greater than 1 for the number of buckets to include in the target, but you do not select the bucket columns in the
Target Fields
tab, Data Integration does not create the buckets in the target Hive table.
You cannot create more than two partitions for a Hive target on the Cloudera CDP version 7.1.1 cluster.
If you create a Hive target on the Cloudera EMR cluster and the data is of the RC file format, you cannot insert null values to the binary data type column.
Mapping
Consider the following rules and guidelines for mappings:
When you select columns other than the String data type as partition columns for a target that you want to create at runtime, the data type is written as String in the target. You cannot select a field of the Binary data type for partitioning when you create a new target at runtime.
Mappings in advanced mode
You cannot select a field of the Binary data type for partitioning when you write data to the target.