Table of Contents

Search

  1. Preface
  2. Introduction to PowerExchange for Microsoft Azure Data Lake Storage Gen2
  3. PowerExchange for Microsoft Azure Data Lake Storage Gen2 Configuration
  4. Microsoft Azure Data Lake Storage Gen2 Connections
  5. PowerExchange for Microsoft Azure Data Lake Storage Gen2 Data Objects
  6. Microsoft Azure Data Lake Storage Gen2 Mappings
  7. Appendix A: Microsoft Azure Data Lake Storage Gen2 Datatype Reference

PowerExchange for Microsoft Azure Data Lake Storage Gen2 User Guide

PowerExchange for Microsoft Azure Data Lake Storage Gen2 User Guide

Advanced Properties

Advanced Properties

You can use the advanced properties to specify data object write operation properties to write data to a
Microsoft Azure Data Lake Storage Gen2
server.
The following table describes the advanced properties that you configure for a
Microsoft Azure Data Lake Storage Gen2
write operation:
Property
Description
Tracing Level
By default, the tracing level for every transformation is Normal. Change the tracing level to a Verbose setting when you need to troubleshoot a transformation that is not behaving as expected. Set the tracing level to Terse when you want the minimum amount of detail to appear in the log.
Maintain row order
Not applicable
Concurrent Threads
Number of concurrent connections to write data to
Microsoft Azure Data Lake Storage Gen2
. When writing a large file or object, you can spawn multiple threads to process data. Configure
Block Size
to partition a large file into smaller parts. Default is 10.
Filesystem Name Override
Overrides the default file system name.
Directory Override
Overrides the default directory path. You can specify an absolute or a relative directory path.
  • Absolute path. The Data Integration Service searches this directory path in the specified file system. Example of absolute path:
    Dir1/Dir2
  • Relative path: The Data Integration Service searches this directory path in the native directory path of the object. Example of relative path:
    /Dir1/Dir2
File Name Override
Overrides the file name.
Write Strategy
If the target exists, overwrites or appends the data to the existing file. You can also configure the mapping to fail if the target already exists.
Append
is applicable only to flat files in the native environment.
Block Size
Partitions a large file or object into smaller parts each of specified block size. When writing a large file, consider partitioning the file into smaller parts and configure
Concurrent Threads
to spawn required number of threads to process data in parallel.
Compression Format
Compresses data when you write data to
Microsoft Azure Data Lake Storage Gen2
. You can compress the data in the following formats:
  • None. No compression format.
  • Gzip. Applicable only to flat files. The target file extension must be
    .GZ
    .
  • Snappy. Applicable only to Avro, ORC, and Parquet file formats in
    Microsoft Azure Data Lake Storage Gen2
    .
Default is None.
Timeout Interval
The number of seconds to wait when attempting to connect to the server. A timeout will occur if the connection cannot be established in the specified amount of time. Default is 0.
Stream Rollover Size in GB
Not applicable.
Stream Rollover Time in hours
Not applicable.
Interim Directory
Not applicable.
Partition Option
Select one of the following partition options when you configure a dynamic mapping on the Spark engine:
  • None. Partitioning is not configured.
  • Last N Columns Partitioned. The last N columns are selected for partitioning.
  • Partition Column Names. Comma-separated column names are selected for partitioning.
Partition Arguments
The number or names of partition columns.
If you select
None
, as the partition option, do not specify a partition argument.
If you select
Last N Columns Partitioned
as the partition option, specify an integer value as the partition argument.
If you select
Partition Column Names
as the partition option, specify comma-separated column names as the partition argument.

0 COMMENTS

We’d like to hear from you!