Table of Contents

Search

  1. Preface
  2. Introduction to Test Data Management
  3. Test Data Manager
  4. Projects
  5. Policies
  6. Data Discovery
  7. Creating a Data Subset
  8. Performing a Data Masking Operation
  9. Data Masking Techniques and Parameters
  10. Data Generation
  11. Data Generation Techniques and Parameters
  12. Working with Test Data Warehouse
  13. Analyzing Test Data with Data Coverage
  14. Plans and Workflows
  15. Monitor
  16. Reports
  17. ilmcmd
  18. tdwcmd
  19. tdwquery
  20. Appendix A: Data Type Reference
  21. Appendix B: Data Type Reference for Test Data Warehouse
  22. Appendix C: Data Type Reference for Hadoop
  23. Appendix D: Glossary

Data Discovery Overview

Data Discovery Overview

Run profiles to discover source data for data masking and data subset operations.
Before you run data masking, you can discover which columns to mask with the same masking rules. Before you create a data subset, you can discover relationships between tables and the entities for the data subset database. You can apply profiling results instead of manually configuring data subset entities or manually applying a data masking rule to one column at a time.
You can run the following types of profiles:
Foreign key
Determines relationships between tables. When you run the profile, the Data Integration Service evaluates the structure of the source tables and identifies potential table relationships. Select entities from the profile results. TDM adds the logical constraints in the TDM repository, but does not apply these constraints in the source tables. When you generate a workflow from a data subset plan, TDM uses the constraints to create subset data with referential integrity.
Primary key
When you run the profile, the Data Integration Service evaluates the structure of the source tables and identifies potential primary keys. The primary key profile includes the percentage of unique values in a column or combination of columns. Select primary keys from the profile results. TDM adds the primary keys in the TDM repository, but does not add them to the source tables.
Data domain
Identifies the columns that belong in a data domain based on the data value of the column or the column name. Use the results when you apply data masking rules. You can apply a rule to all columns that belong to the data domain instead of manually applying the rule to one column at a time.
Column
Analyzes data quality in selected columns in a table. A column profile determines the characteristics of columns in a data source, such as range of values, list of values, and data patterns. When you create a column profile, select the tables on which you want to run a profile and configure data sampling options.
You can run data domain profiles and primary key profiles at the same time. You can also import and run profiles that you create in Informatica Developer.

0 COMMENTS

We’d like to hear from you!