Table of Contents

Search

  1. Preface
  2. Part 1: Introduction to Data Discovery
  3. Part 2: Data Discovery with Informatica Analyst
  4. Part 3: Data Discovery with Informatica Developer
  5. Appendix A: Function Support Based on Profiling Warehouse Connection

Data Discovery Guide

Data Discovery Guide

Data Domain Inference Options in Informatica Analyst

Data Domain Inference Options in Informatica Analyst

Inference options determine whether data domain discovery must run on column data, column name, or both. You can specify the maximum number of source rows the profile can analyze. You can choose a conformance criteria for data domain discovery. You can exclude null values from data domain discovery. You can set the data domain inference options in the
Specify Settings
screen in the profile wizard.
The following table describes the inference options for data domain discovery:
Option
Description
Data
Runs the profile on column data.
Columns
Runs the profile on column titles.
Data and Columns
Runs the profile on both column data and column titles.
Minimum percentage of rows
The minimum conformance percentage of rows in the data set required for a data domain match.
Minimum number of rows
The minimum number of rows in the data set required for a data domain match.
Exclude null values for data domain discovery
Excludes the null values from the data set for data domain discovery.
Edit
Select the columns for data domain discovery.
All Rows
Runs the profile on all rows from the source.
Sample first
Choose maximum number of rows the profile can run on. The Analyst tool chooses the rows starting from the first row in the source. You can choose a maximum of 2,147,483,647 rows.
Random sample
Choose a random sample of rows from the data source. You can choose a maximum of 2,147,483,647 rows.
Random sample (auto)
The Analyst tool chooses a random sample of rows based on the size of the data source.
Exclude approved data types and data domains from the data type and data domain inference in the subsequent profile runs
Excludes the approved data type or data domain from data type and data domain inference from the next profile run.

0 COMMENTS

We’d like to hear from you!