Table of Contents

Search

  1. Preface
  2. Introduction to Data Engineering Streaming
  3. Data Engineering Streaming Administration
  4. Sources in a Streaming Mapping
  5. Targets in a Streaming Mapping
  6. Streaming Mappings
  7. Window Transformation
  8. Appendix A: Connections
  9. Appendix B: Monitoring REST API Reference
  10. Appendix C: Sample Files

FileName Port in Amazon S3

FileName Port in Amazon S3

When you create a data object read or write operation for Amazon S3 files, the FileName port appears by default.
When the Spark engine writes to Amazon S3 files using FileName port, the data is written in the following process:
  1. At run time, the Data Integration Service creates separate directories for each value in the FileName port and adds the target files within the directories.
  2. The file rollover process closes the current file to which data is being written to and creates a new file based on the configured rollover value.
    You can use the following optional execution parameters to configure rollover:
    • rolloverTime. You can configure rollover for a target file when a certain period of time has elapsed. Specify the rollover time in hours. The default rollover time is 1 hour.
    • rolloverSize. You can configure rollover for a target file when the target file reaches a certain size. Specify the size in GB. The default rollover size is 1 GB.
  3. When a target file reaches the configured rollover value, the target file is rolled over and moved to the specified Amazon S3 target location.
  4. Sub-directories are created in the specified Amazon S3 target location for each value in the FileName port.
  5. The rolled over target files are moved to the sub-directories created for each value in the FileName port in the specified Amazon S3 target location.
The default rollover is based on file size. You can configure both rollover schemes for an Amazon S3 target file. The event that occurs first triggers a rollover. For example, if you configure rollover time to 1 hour and rollover size to 1 GB, the target service rolls the file over when the file reaches a size of 1 GB even if the 1 hour period has not elapsed.
For more information about FileName port, see the
PowerExchange for Amazon S3 User Guide
.