Table of Contents

Search

  1. Preface
  2. Introduction to Data Transformation
  3. Data Processor Transformation
  4. Wizard Input and Output Formats
  5. Relational Input and Output
  6. XMap
  7. Libraries
  8. Schema Object
  9. Command Line Interface
  10. Scripts
  11. Parsers
  12. Script Ports
  13. Document Processors
  14. Formats
  15. Data Holders
  16. Anchors
  17. Transformers
  18. Actions
  19. Serializers
  20. Mappers
  21. Locators, Keys, and Indexing
  22. Streamers
  23. Validators, Notifications, and Failure Handling
  24. Validation Rules
  25. Custom Script Components

Data Transformation User Guide

Data Transformation User Guide

Formats Overview

Formats Overview

The
format
property of a Parser defines the format of the documents for the transformation to process. The value of the property is one of the following format components:
BinaryFormat CustomFormat HtmlFormat RtfFormat TextFormat XmlFormat
The format has properties of its own, which further define how the Parser interprets and processes the input.
The following table describes the sub-components that you can nest in a format:
Subcomponent
Description
Delimiter
Defines a hierarchy of characters or strings that organize the information in the document, such as newlines and tabs.
Format preprocessor
Cleans up the source before the Parser starts searching for anchors.
Default transformer
Performs predefined operations on the output of each anchor.