Table of Contents

Search

  1. Preface
  2. Introduction to Test Data Management
  3. Test Data Manager
  4. Projects
  5. Policies
  6. Data Discovery
  7. Creating a Data Subset
  8. Performing a Data Masking Operation
  9. Data Masking Techniques and Parameters
  10. Data Generation
  11. Data Generation Techniques and Parameters
  12. Working with Test Data Warehouse
  13. Analyzing Test Data with Data Coverage
  14. Plans and Workflows
  15. Monitor
  16. Reports
  17. ilmcmd
  18. tdwcmd
  19. tdwquery
  20. Appendix A: Data Type Reference
  21. Appendix B: Data Type Reference for Test Data Warehouse
  22. Appendix C: Data Type Reference for Hadoop
  23. Appendix D: Glossary

Data Type Reference for Hadoop Overview

Data Type Reference for Hadoop Overview

You can perform data movement, data domain discovery, and data masking operations on Hadoop data sources.
Use Hive and HDFS connections in a Hadoop plan to perform data movement, data domain discovery, and data masking operations. When you generate and run the Hadoop plan, TDM generates the mappings and the Data Integration Service pushes the mappings to the Hadoop cluster to improve the performance.
Use a Hadoop HDFS connection in a TDM plan to perform data group movement and data masking operations. When you run a TDM plan with the Hadoop HDFS connection, TDM uses PowerCenter to run the mappings.
When the target is Hive, HDFS, or Hadoop HDFS, TDM supports the data types for the following source connections:
  • Oracle
  • Microsoft SQL Server
  • DB2 for Linux, UNIX, and Windows
  • Sybase ASE
  • Hive
  • HDFS
  • Hadoop HDFS
  • Flat File