Table of Contents

Search

  1. Preface
  2. Introduction to Informatica Big Data Management
  3. Mappings
  4. Sources
  5. Targets
  6. Transformations
  7. Data Preview
  8. Cluster Workflows
  9. Profiles
  10. Monitoring
  11. Hierarchical Data Processing
  12. Hierarchical Data Processing Configuration
  13. Hierarchical Data Processing with Schema Changes
  14. Intelligent Structure Models
  15. Stateful Computing
  16. Appendix A: Connections
  17. Appendix B: Data Type Reference
  18. Appendix C: Function Reference

Parquet and Transformation Data Types

Parquet and Transformation Data Types

Apache Parquet data types map to transformation data types that the Data Integration Service uses to move data across platforms.
The following table compares Parquet data types and transformation data types:
Parquet
Transformation
Description
Binary
Binary
1 to 104,857,600 bytes
Binary (UTF8)
String
1 to 104,857,600 characters
Boolean
Integer
-2,147,483,648 to 2,147,483,647
Precision of 10, scale of 0
Double
Double
Precision of 15 digits
Fixed Length Byte Array
Decimal
Decimal value with declared precision and scale. Scale must be less than or equal to precision.
For transformations that support precision up to 38 digits, the precision is 1 to 38 digits, and the scale is 0 to 38.
For transformations that support precision up to 28 digits, the precision is 1 to 28 digits, and the scale is 0 to 28.
If you specify the precision greater than the maximum number of digits, the Data Integration Service converts decimal values to double in high precision mode.
Float
Double
Precision of 15 digits
group (LIST)
Array
Unlimited number of characters.
Int32
Integer
-2,147,483,648 to 2,147,483,647
Precision of 10, scale of 0
Int64
Bigint
-9,223,372,036,854,775,808 to 9,223,372,036,854,775,807
Precision of 19, scale of 0
Int64 (TIMESTAMP_MILLIS)
Date/Time
Jan 1, 0001 A.D. to Dec 31, 9999 A.D.
Precision of 29, scale of 9
(precision to the nanosecond)
Combined date/time value.
Int96
Date/Time
Jan 1, 0001 A.D. to Dec 31, 9999 A.D.
Precision of 29, scale of 9
(precision to the nanosecond)
Combined date/time value.
Map
Map
Unlimited number of characters.
Struct
Struct
Unlimited number of characters.
Union
Corresponding primitive data type in a union of ["primitive_type", "null"] or ["null", "primitive_type"].
Dependent on primitive data type.

0 COMMENTS

We’d like to hear from you!