When you create a data object read or write operation for Microsoft Azure Data Lake Store Gen2 (ADLS Gen2) files, the FileName port appears by default.
When the Spark engine writes to ADLS Gen2 files using a FileName port, it uses the following process to write the data:
The Data Integration Service creates separate directories for each value in the FileName port and adds the target files within the directories.
The file rollover process closes the current file to which data is being written to and creates a new file based on the configured rollover value.
Effective in 10.4.1, you can configure the following rollover parameters at the design-time based on time and size:
Stream Rollover Time in Hours. Specify the rollover time in hours for a target file when a certain period of time has elapsed.
Stream Rollover Size in GB. Specify the size in GB for a target file when the target file reaches a certain size.
Technical preview functionality is supported for evaluation purposes but is unwarranted and is not production-ready. Informatica recommends that you use in non-production environments only. Informatica intends to include the preview functionality in an upcoming release for production use, but might choose not to in accordance with changing market or technical circumstances. For more information, contact Informatica Global Customer Support.
When a target file reaches the configured rollover value, the target file is rolled over and moved to the specified ADLS Gen2 target location.
The Spark engine creates sub-directories in the specified ADLS Gen2 target location for each value in the FileName port.
The Spark engine moves the rolled over target files to the sub-directories created for each value in the FileName port in the ADLS Gen2 target location.
You can configure both rollover schemes for an ADLS Gen2 target file. The Spark engine rolls over to based on the first event that triggers. For example, if you configure rollover time to 1 hour and rollover size to 1 GB, the target service rolls the file over when the file reaches a size of 1 GB even if the 1 hour period has not elapsed.