Table of Contents

Search

  1. Preface
  2. Introduction to Data Transformation
  3. Data Processor Transformation
  4. Wizard Input and Output Formats
  5. Relational Input and Output
  6. XMap
  7. Libraries
  8. Schema Object
  9. Command Line Interface
  10. Scripts
  11. Parsers
  12. Script Ports
  13. Document Processors
  14. Formats
  15. Data Holders
  16. Anchors
  17. Transformers
  18. Actions
  19. Serializers
  20. Mappers
  21. Locators, Keys, and Indexing
  22. Streamers
  23. Validators, Notifications, and Failure Handling
  24. Validation Rules
  25. Custom Script Components

Data Transformation User Guide

Data Transformation User Guide

Regular Expression Syntax

Regular Expression Syntax

A regular expression defines a search pattern according to a standard syntax.
The Data Processor transformation uses the Regex++ implementation of regular expressions, © 1998-2003 by Dr. John Maddock, Version 1.33, 18 April 2000.
Regex++ does not support locales.
The following table lists some special characters that you can use in regular expressions:
Character
Meaning
Example
*
Matches zero or more instances of the preceding character.
ab*c
matches
ac
,
abc
, or
abbbc
.
?
Matches zero or one instance of the preceding character.
ab?c
matches
ac
or
abc
.
+
Matches one or more instances of the preceding character.
a+
matches
a
or
aaaa
.
{}
Matches the specified number of instances of the preceding character.
ab{2}c
matches
abbc
.
[]
Matches any of a set of characters.
a[bst]c
matches
abc
,
asc
, or
atc
.
-
Defines a range of characters inside square brackets.
[A-Za-z]
matches any character in the English alphabet.
[A-Za-zü]
matches any character in the English alphabet or the German
ü
.
.
Matches any single character.
a.c
matches
abc
,
a c
, or
a1c
.
^
Matches the start of the input text.
^P.
matches
Pe
but not
Pi
in "Peter Piper."
$
Matches the end of the input text.
r.$
matches
rs
in "Peter Piper's peppers."
|
Matches either of two expressions.
abc|ded
matches
abc
or
def
.
()
Grouping
A(abc|def)
matches
Aabc
or
Adef
.
\
Escapes one of the other special characters, treating it as a literal character.
\.
matches a literal period, rather than any character.


Updated September 26, 2018