Table of Contents

Search

  1. Preface
  2. Introduction to Data Engineering Streaming
  3. Data Engineering Streaming Administration
  4. Sources in a Streaming Mapping
  5. Targets in a Streaming Mapping
  6. Streaming Mappings
  7. Transformation in Streaming Mappings
  8. Window Transformation
  9. Appendix A: Connections
  10. Appendix B: Monitoring REST API Reference
  11. Appendix C: Sample Files

FileName Port in Amazon S3

FileName Port in Amazon S3

When you create a data object read or write operation for Amazon S3 files, the FileName port appears by default.
When the Spark engine writes to Amazon S3 files using a FileName port, it uses the following process to write the data:
  1. The Data Integration Service creates separate directories for each value in the FileName port and adds the target files within the directories.
  2. The file rollover process closes the current file to which data is being written to and creates a new file based on the configured rollover value.
    Effective in 10.4.1, you can configure the following optional execution rollover parameters at the design-time based on time and size:
    • Stream Rollover Time in Hours. Specify the rollover time in hours for a target file when a certain period of time has elapsed.
    • Stream Rollover Size in GB. Specify the size in GB for a target file when the target file reaches a certain size.
  3. When a target file reaches the configured rollover value, the Spark engine rolls over and moves the target file to the specified Amazon S3 target location.
  4. The Spark engine creates sub-directories in the specified Amazon S3 target location for each value in the FileName port.
  5. The Spark engine moves the rolled over target files to the sub-directories created for each value in the FileName port in the specified Amazon S3 target location.
By default, the Spark engine rolls over based on file size. You can configure both rollover schemes for an Amazon S3 target file. The Spark engine rolls over to based on the first event that triggers. For example, if you configure rollover time to 1 hour and rollover size to 1 GB, the target service rolls the file over when the file reaches a size of 1 GB even if the 1 hour period has not elapsed.
For more information about FileName port, see the
PowerExchange for Amazon S3 User Guide
.

0 COMMENTS

We’d like to hear from you!