PowerExchange for Microsoft Azure Data Lake Storage Gen2 User Guide

10.5
- 10.5.6
- 10.5.4
- 10.5.3
- 10.5.2
- 10.5.1
- 10.4.1
- 10.4.0

Back Next

Advanced Properties

You can use the advanced properties to specify data object read operation properties to read data from a

Microsoft Azure Data Lake Storage Gen2

server.

The following table describes the Advanced source column properties of the data object read operation:

Property	Description
Tracing Level	Sets the amount of detail that appears in the log file. You can choose terse, normal, verbose initialization, or verbose data. Default is normal.
Concurrent Threads	Number of concurrent connections to read data from Microsoft Azure Data Lake Storage Gen2 . When reading a large file or object, you can spawn multiple threads to process data. Configure Block Size to partition a large file into smaller parts. Default is 10.
Filesystem Name Override	Overrides the file system name.
Source Type	Select the type of source from which you want to read data. You can select the following source types: File Directory Default is File. The directory read is not applicable to binary files.
Allow Wildcard Characters	Indicates whether you want to use wildcard characters for the source file name when you run a mapping to read data from an Avro, flat, JSON, ORC, or Parquet file. For more information about wildcard characters, see Wildcard Characters.
Directory Override	Overrides the default directory path. You can specify an absolute or a relative directory path. Absolute path. The Data Integration Service searches this directory path in the specified file system. Example of absolute path: Dir1/Dir2 Relative path: The Data Integration Service searches this directory path in the native directory path of the object. Example of relative path: /Dir1/Dir2
File Name Override	Overrides the file name.
Recursive Directory Depth	Specify the number of sub-directory levels you want to read in the directory in the native environment or on the Spark engine. For more information about recursive directory depth, see Reading Files from Subdirectories.
Block Size	Partitions a large file or object into smaller parts each of specified block size. When reading a large file, consider partitioning a large file into smaller parts and configure Concurrent Threads to spawn required number of threads to process data in parallel.
Timeout Interval	The number of seconds to wait when attempting to connect to the server. A timeout will occur if the connection cannot be established in the specified amount of time. Default is 0.
Compression Formats	Compresses data when you read data Microsoft Azure Data Lake Storage Gen2 . You can read the following compressed files: None: no compression format. Gzip: applicable to flat files. The source file extension must be .GZ . Snappy: applicable to complex files. Default is None. If you select a .GZ source object and select None in the compression format, though the mapping runs successfully, data is not written to the target object.

Rename Saved Search

Table of Contents

PowerExchange for Microsoft Azure Data Lake Storage Gen2 User Guide

PowerExchange for Microsoft Azure Data Lake Storage Gen2 User Guide

Advanced Properties

Advanced Properties