Table of Contents

Search

  1. Preface
  2. Introduction to Test Data Management
  3. Test Data Manager
  4. Projects
  5. Policies
  6. Data Discovery
  7. Data Subset
  8. Data Masking
  9. Data Masking Techniques and Parameters
  10. Data Generation
  11. Data Generation Techniques and Parameters
  12. Data Sets
  13. Plans and Workflows
  14. Monitor
  15. Reports
  16. ilmcmd
  17. tdwcmd
  18. tdwquery
  19. Data Type Reference
  20. Data Type Reference for Test Data Warehouse
  21. Data Type Reference for Hadoop
  22. Glossary

Hadoop Data Source Settings

Hadoop Data Source Settings

Enter source and target connections for the Hadoop plan.
The following table describes connection options:
Connection Options
Description
Source Connection
Required. A connection to the source database. Select a source connection from the list. When you create a Hadoop plan, you can select Oracle, DB2, Sybase, Microsoft SQL Server, Hive, flat file, or HDFS connections.
Target Connection
Required. When you create a Hadoop plan, you can select a relational or an HDFS target connection from the list. When you select a relational target connection type, you can select the Hive connection.
Output Filename
The name of the target file. Required if the target is HDFS. Default extension is .csv.
Column Delimiter
A character that separates columns from each other in the .csv file. Required if the source is a flat file. Default is a comma (,).
Row Separator
A character that separates columns from each other in the .csv file. Required if the source is a flat file. Default is a new line.
Truncate Tables
Truncates the table before loading it. By default, this option is selected. You can truncate the tables for Hive connections. You cannot truncate tables for HDFS connections.
Date-time Format String
Date-time format defined in the session properties. You can enter seconds, milliseconds, microseconds, or nanoseconds.
  • Seconds. MM/DD/YYYY HH24:MI:SS
  • Milliseconds. MM/DD/YYYY HH24:MI:SS.MS
  • Microseconds. MM/DD/YYYY HH24:MI:SS.US
  • Nanoseconds. MM/DD/YYYY HH24:MI:SS.NS
Default is microseconds.
Max Parallel Sessions
The maximum number of mappings that can run at the same time. Default number of mappings is 5.
Locale
Sets the locale for data movement and data masking operations.