Table of Contents

Search

  1. Preface
  2. Introduction to Hive Connector
  3. Hive connections
  4. Mappings and mapping tasks with Hive Connector
  5. Migrating a mapping
  6. Data type reference
  7. Troubleshooting

Hive Connector

Hive Connector

Rules and guidelines for adding partitioning columns

Rules and guidelines for adding partitioning columns

Consider the following general rules and guidelines when you include partitioned columns to a target created at runtime:
  • You cannot use an existing table to add partitioned columns.
  • Parallel processing is not applicable for Hive Connector.
  • If you specify a value greater than 1 for the number of buckets to include in the target, but you do not select the bucket columns in the
    Target Fields
    tab, Data Integration does not create the buckets in the target Hive table.
  • You cannot create more than two partitions for a Hive target on the Cloudera CDP version 7.1.1 cluster.
  • If you create a Hive target on the Cloudera EMR cluster and the data is of the RC file format, you cannot insert null values to the binary data type column.

Mapping

Consider the following rules and guidelines for mappings:
  • When you select columns other than the String data type as partition columns for a target that you want to create at runtime, the data type is written as String in the target. You cannot select a field of the Binary data type for partitioning when you create a new target at runtime.

Mappings in advanced mode

You cannot select a field of the Binary data type for partitioning when you write data to the target.

0 COMMENTS

We’d like to hear from you!