Table of Contents

Search

  1. Preface
  2. Introduction to Data Validation Option
  3. New Features and Behavior Changes
  4. Repositories
  5. XML Data Source
  6. Tests for XML Data Sources
  7. Connections
  8. Expressions
  9. Table Pairs
  10. Tests for Table Pairs
  11. Single-Table Constraints
  12. Tests for Single-Table Constraints
  13. Examples of Tests from Spreadsheets
  14. SQL Views
  15. Lookup Views
  16. Join Views
  17. Aggregate Views
  18. Business Intelligence and Reporting Tools Reports
  19. Dashboards
  20. DVOCmd Command Line Program
  21. Troubleshooting
  22. Datatype Reference
  23. Reporting Views
  24. Metadata Import Syntax
  25. Jasper Reports
  26. Glossary

Data Validation Option User Guide

Data Validation Option User Guide

Data Sampling Processing

Data Sampling Processing

Data sampling is when Data Validation Option runs tests on a sample of data rather than the entire source. The PowerCenter Integration Service or a relational data source can generate the data sample.
By default, the PowerCenter Integration Service generates the data sample. The PowerCenter Integration Service reads all records from the source when it generates the data sample.
To increase run-time performance, you can configure IBM DB2, Microsoft SQL Server, Oracle, and Teradata to generate the data sample. After the database generates the data sample, the PowerCenter Integration Service reads the sample data only.