Table of Contents

Search

  1. Preface
  2. Introduction to Data Transformation
  3. Data Processor Transformation
  4. Wizard Input and Output Formats
  5. Relational Input and Output
  6. XMap
  7. Libraries
  8. Schema Object
  9. Command Line Interface
  10. Scripts
  11. Parsers
  12. Script Ports
  13. Document Processors
  14. Formats
  15. Data Holders
  16. Anchors
  17. Transformers
  18. Actions
  19. Serializers
  20. Mappers
  21. Locators, Keys, and Indexing
  22. Streamers
  23. Validators, Notifications, and Failure Handling
  24. Validation Rules
  25. Custom Script Components

Data Transformation User Guide

Data Transformation User Guide

Editor Options

Editor Options

The following table describes the controls and fields in the
PdfToTxt_4
table configuration editor.
Control or Field
Description
Zoom In
Make the PDF display larger.
Zoom Out
Make the PDF display smaller.
Fit Width
Display the PDF document according to the width of the window.
Prev Page
Go to the previous page.
Next Page
Go to the next page.
Find
Search for a string in the PDF.
Add Table
Add a table to the configuration.
Rem. Table
Remove a table from the configuration.
Add Column
Add a column border to the current table.
Rem. Column
Delete the currently selected column border.
Process
Apply the current table definitions. Click
Process
after every table and column-related action to apply that action.
Tables
A list of tables defined in the input PDF. You can select a table by clicking it.
Name
Name of the currently selected table.
Table Start
An expression defining the upper left corner of the table.
Table End
An expression defining the first text after the table.
Page Header
An expression defining the end of the page header. Use this option to exclude the header from the table processing.
Page Footer
An expression defining the end of the page footer. Use this option to exclude the footer from the table processing.
Use Regular Expressions
If selected, the processor interprets the
Table Start
,
Table End
,
Page Header
, and
Page Footer
as regular expressions and searches for matching text. If not selected the processor interprets these fields as literal text.
Recalculate at Runtime
If you select this option,
PdfToTxt_4
ignores the table configurations that you specified using the table configuration editor. This feature is useful if the tables in a PDF are simple enough for the
PdfToTxt_4
to process without special configuration.
For example, suppose a simple PDF financial statement contains a table whose columns may vary slightly from month to month. Select the
Recalculate at Runtime
option to have
PdfToTxt_4
adjust the column widths at runtime.
Recalculate Now
If you have changed the table definition, for example by changing column borders or adding a
Page Header
or
Page Footer
, click
Recalculate Now
to update the table definition.
Page
Number of the PDF page that is currently displayed.
Output as XML
Generates the
PdfToTxt_4
output as XML instead of text.
Delimiter
Enter a character to use as the column separator in the text output. The default is a vertical bar (|).
OK
Click to save the table configuration and return to the Developer tool.
Cancel
Click to return to the Developer tool without saving the table configuration.
Table Navigation Aid
The table navigation aid displays the number of times a table is found in the PDF document. An example of a navigation aid is
Table ‘Table 1’ found 2 times
. The arrows next to this information let you jump back and forth among the instances of the same table structure.


Updated September 26, 2018