PowerExchange for Microsoft Azure Data Lake Storage Gen2 User Guide

10.5.3
- 10.5.4
- 10.5.2
- 10.5.1
- 10.5
- 10.4.1
- 10.4.0

Back Next

Microsoft Azure Data Lake Storage Gen2 Read Use Case

Microsoft Azure Data Lake Storage Gen2
Read Use Case

If you want to read large data sets, the task can take a long time to process. You can configure the following read operation properties to partition the source and read the partitions concurrently, which can optimize performance:

Block Size

: partitions a large file or object into smaller parts each of specified block size. When reading a large file, consider partitioning a large file into smaller parts and configure

Concurrent Threads

to spawn required number of threads to process data in parallel.

Concurrent Threads

: number of concurrent connections to read data from

Microsoft Azure Data Lake Storage Gen2

. When reading a large file or object, you can spawn multiple threads to process data. Default is 10.

You must configure

Block Size

if you want multiple threads to process data in parallel.

The following image shows the source properties for parallel read from a large source file: The images shows read operation properties required to read data in parallel from a large source file.

The images shows read operation properties required to read data in parallel from a large source file.

Rename Saved Search

Table of Contents

PowerExchange for Microsoft Azure Data Lake Storage Gen2 User Guide

PowerExchange for Microsoft Azure Data Lake Storage Gen2 User Guide

Microsoft Azure Data Lake Storage Gen2 Read Use Case

Microsoft Azure Data Lake Storage Gen2 Read Use Case

Microsoft Azure Data Lake Storage Gen2
Read Use Case