Creating a Column Profile to Perform Data Domain Discovery in Informatica Analyst
Creating a Column Profile to Perform Data Domain Discovery in Informatica Analyst
You need to create at least one data domain before you can create a column profile to perform data domain discovery in the Analyst tool. The profile can discover both column name and column data that match predefined data domains.
In the
Discovery
workspace, click
Profile
, or select
New
Profile
from anywhere in the Analyst tool.
The
New Profile
wizard appears.
The
Single source
option is selected by default. Click
Next
.
In the
Specify General Properties
screen, enter a name and an optional description for the profile. In the Location field, select the project or folder where you want to create the profile. Click
Next
.
In the
Select Source
screen, click
Choose
to select a data object, or click
New
to import a data object. Click
Next
.
In the
Specify Settings
screen, choose to run a column profile, data domain discovery, or a column profile and data domain discovery. By default, column profile option is selected.
Choose
Run data domain discovery
to perform data domain discovery. Select the data domain options in the
Data Domain
pane.
Choose
Run column profile
and
Run data domain discovery
to run the column profile and data domain discovery. Select the data domain options in the
Data domain
pane.
By default, the columns that you select for column profile is also applicable to data domain discovery. Click
Edit
to select or deselect columns for data domain discovery irrespective of the columns that you select for column profile.
Choose Data, Columns, or Data and Columns to run data domain discovery on.
Choose a sampling option in the
Run profile on
pane.
Choose a drilldown option in the
Drilldown
pane. Optionally, click
Select Columns
to select columns to drill down on. You can choose to omit data type and data domain inference for columns with approved data type or data domain.
Choose a conformance criteria, and you can select
Exclude null values from data domain discovery
option.
Choose
Native
, or
Hadoop
as the run-time environment. You can choose Blaze, or Spark option in the Hadoop run-time environment. If you choose the Blaze option, click
Choose
to select a Hadoop connection in the
Select a Hadoop Connection
dialog box. If you choose the Spark option, click
Choose
to select a Hadoop connection in the
Select a Hadoop Connection
dialog box.
In the
Specify Rules and Filters
screen, you can add, edit, or delete rules and filters for the profile.