Document processors are components that convert the format of a complete document to another format for processing.
You can use a document processor as a pre-processor that converts the format of a source document before a transformation. For example, if the source document of a parser is in the PDF format, you might apply the
PdfToTxt_4
processor. This converts the source document to text, which is much easier to parse than the binary PDF format.
Do not confuse document processors with format preprocessors. For more information about format preprocessors, see
Formats Overview.