Table of Contents

Search

  1. Preface
  2. Introduction to Test Data Management
  3. Test Data Manager
  4. Projects
  5. Policies
  6. Data Discovery
  7. Creating a Data Subset
  8. Performing a Data Masking Operation
  9. Data Masking Techniques and Parameters
  10. Data Generation
  11. Data Generation Techniques and Parameters
  12. Working with Test Data Warehouse
  13. Analyzing Test Data with Data Coverage
  14. Plans and Workflows
  15. Monitor
  16. Reports
  17. ilmcmd
  18. tdwcmd
  19. tdwquery
  20. Appendix A: Data Type Reference
  21. Appendix B: Data Type Reference for Test Data Warehouse
  22. Appendix C: Data Type Reference for Hadoop
  23. Appendix D: Glossary

User Guide

User Guide

Hadoop HDFS Data Sources

Hadoop HDFS Data Sources

Create and use Hadoop HDFS connections in Test Data Manager to use PowerCenter mappings to perform data group movement and data masking operations.
Import metadata from a Hadoop HDFS connection and perform data group movement and data masking operations. A Hadoop HDFS connection type is an HDFS connection that uses PowerCenter mappings to perform a TDM operation. In a TDM plan, you can use a Hadoop HDFS connection as a source, a target, or both.
You can perform the write operation in the following cases:
  • Source is a relational database such as Oracle, Microsoft SQL, Sybase, and DB2 for Linux, UNIX, and Windows and the target is a Hadoop HDFS connection type
  • Source is a flat file and the target is a Hadoop HDFS connection type
  • Both the source and the target are Hadoop HDFS connection types
  • Source is a Hadoop HDFS connection type and the target is a flat file
The Hadoop HDFS connection contains the following limitations:
  • You cannot perform a write operation when the source is a Hadoop HDFS connection type and the target is a relational database.
  • You cannot use a Hadoop HDFS connection type in a Hadoop plan.
  • You cannot use shuffle masking when both the source and the target are Hadoop HDFS connection types.
  • You cannot use a Hadoop HDFS connection type as a variable when you configure a source or a target.

0 COMMENTS

We’d like to hear from you!