document processor converts Microsoft Word documents to XML.
The processor output is in the UTF-8 encoding. If a transformation receives input from the processor, you must set the input encoding to UTF-8.
This component supports Word version 97 and higher. It accesses its input directly, not through Microsoft Word. You do not need to install Word on the computer.
This component is implemented in Java and requires correct configuration of the Java Runtime Environment (JRE).