Table of Contents

Search

  1. Preface
  2. Data Profiling
  3. Profiles
  4. Profile results
  5. Tuning data profiling task performance
  6. Troubleshooting

Data Profiling

Data Profiling

Edit a profile

Edit a profile

You can edit a profile for the next profile run. You can change the profile definition, add or remove filters, add or remove rules, choose another runtime environment, edit schedule details, edit email address for notifications, and edit advanced options.

Definition

On the
Definition
tab, you can edit the following options for the next profile run:
Asset Details
Change or edit the
Name
and
Description
options.
The
Asset Details
area shows the location of the asset, user who created the profile, timestamp of the profile run, and used storage.
The
Used Storage (MB)
field shows the storage space consumed in the profiling warehouse for the profile results that you view on the
Results
tab. The storage space depends on the sampling option, columns, filter, and advanced options that you select for the profile run. It also depends on the identified number of unique values and outliers.
Data Profiling
stores the profile results in the profiling warehouse. The profiling warehouse is an Informatica Intelligent Cloud Services repository where
Data Profiling
stores the profile results.
Connection and Source Details
Switch between connections of the same database type in a profile definition.
Data Profiling
displays all the connections in the profile definition and does validation checks to validate the connection that you select. Choose a different connection or source object for the next profile run. The following list describes the different combinations with which you can edit a connection or source object:
  • If you retain the same connection and choose a different source object that includes the same details as the previous source object,
    Data Profiling
    preserves the configuration settings of the columns that you select to profile, filters, and rules.
  • If you retain the same connection and choose a different source object that does not include the same details as the previous source object, you need to select the columns to profile, and then fix the filters and rules that are not valid.
  • If you choose a different connection and source object, you need to select the columns to profile, configure the filters and rules again from scratch.
  • If you choose a different connection and a source object with same name and includes the same details as the previous source object,
    Data Profiling
    preserves the configuration settings of the columns that you select to profile, filters, and rules.
Profile Settings
Change
Run profile on
or
Drill down
options.
Columns
Select or clear one or more columns.
Filters
Choose a different filter. Optionally, you can create, add, or delete filters.

Rules

On the
Rules
tab, you can choose the rules for the next profile run. Optionally, you can add, or delete rules. When you change the source object,
Data Profiling
automatically assigns rules if the source object attributes match the configuration file parameters. You can include or exclude the rules to the profile.

Schedule

On the
Schedule
tab, you can edit the following options:
Schedule Details
Change the runtime environment and choose a schedule for the next profile run.
Email Notification Options
Change or edit the email notification options.
Advanced Options
Edit the advanced options. For more information about the advanced options, see Advanced options.
For Databricks Delta, you can select or clear the option to run the profiling task in advanced mode. You cannot edit the option after you save the task.

0 COMMENTS

We’d like to hear from you!