Wildcard Characters for Reading Data from Complex Files
Wildcard Characters for Reading Data from Complex Files
When you run a mapping in the native environment or on the Spark engine to read data from an Avro, JSON, ORC, or Parquet file, you can use an asterisk (?) and (*) wildcard characters to specify the source file name.
You can use the following wildcard characters:
? (Question mark)
The question mark character (?) allows one occurrence of any character. For example, if you enter the source file name as
a?b.txt
, the Data Integration Service reads data from files with the following names:
a1b.txt
a2b.txt
aab.txt
acb.txt
* (Asterisk)
The asterisk mark character (*) allows zero or more than one occurrence of any character. If you enter the source file name as
a*b.txt
, the Data Integration Service reads data from files with the following names:
aab.txt
a1b.txt
ab.txt
abc11b.txt
When you read data from the Avro, JSON, ORC, or Parquet file that contains a colon (:) character in the file name, the mapping fails.