Table of Contents

Search

  1. Preface
  2. Data Profiling
  3. Profiles
  4. Profile results
  5. Tuning data profiling task performance
  6. Troubleshooting

Data Profiling

Data Profiling

Viewing scorecards

Viewing scorecards

Use scorecards to measure data quality scores and monitor data quality progress for existing and new profiles.
Click the
View Scorecard
button to view the scorecard dashboard in
Data Governance and Catalog
.
The following table lists the widgets that you can view with the scorecard dashboard:
Widget
Description
Average Latest Scores by Dimensions
Donut charts with round off values of the average latest data quality scores based on dimensions.
Number of Rule Occurrences by Dimensions
Number of rule occurrences for each dimension based on Good, Acceptable, and Not Acceptable threshold values.
Rule Occurrences
Shows the following details of rule occurrences:
  • Latest data quality score
  • Dimension of the rule specification
  • Date and time of latest profile run
  • Total number of rows processed
  • Total number of failed rows
  • Input column or primary data element
  • Preview valid and failed rows. To preview valid or failed rows, hover over the rule occurrence and click the ellipsis button. Then, select
    Preview of Valid Rows
    or
    Preview of Failed Rows
    options.
Every time you run a data profiling task with rule occurrences, the scores on the scorecard dashboard are updated. If you define a rule occurrence but do not execute the profile, then the rule occurrence appears on the scorecard dashboard without any score.
  • Scorecards are created based on a profiling source. If you wish to create a scorecard with rule occurrences from a different source, you must use
    Data Governance and Catalog
    .
  • When you run a data profiling task with a rule occurrence that has rules with multiple input ports, the scorecard dashboard displays the scores corresponding to only one column selected randomly from the input ports.
  • If there are multiple input ports associated with a rule occurrence, the
    Primary Data Element
    on the scorecard dashboard displays the column name randomly from the source that is linked to input ports. For example, if you create a rule occurrence with a rule named
    rs_compare_string
    that has multiple input ports such as
    in_input1
    and
    in_input2
    . Now, if you link FIRSTNAME column with
    in_input1
    input port and LASTNAME column with
    in_input2
    input port, the scorecard dashboard displays the
    Primary Data Element
    randomly. In this case, it can either be FIRSTNAME or LASTNAME.

Example

You are a data analyst. You create and run profiles on a Customer table. You want to check the validity of the data available in the latest profile run.
You perform the following tasks:
  1. Create a rule specification with the appropriate rule logic in
    Data Quality
    and set the dimension to
    Validity
    . When you apply a
    Validity
    dimension to a rule, the output data conforms to defined business rules and falls within allowable parameters when those rules are applied.
  2. Create a profile and associate the rule specification.
  3. Create a rule occurrence on the rule specification with Good, Acceptable, and Not Acceptable threshold values to be considered for scoring.
  4. Save and run the profile.
  5. View the metrics in the
    Data Governance and Catalog
    scorecard dashboard. You can use the metrics to verify the data quality progress in the Customer table.

View stakeholder information

You can view users that have been designated as stakeholders for the rule occurrences on the
Overview
and
Stakeholder
tab in
Data Governance and Catalog
. A stakeholder is an authorized user who is responsible for the rule occurrences, can approve or reject change requests for the occurrence, provide inputs to the properties of the rule occurrence, and are interested in following the asset to monitor changes.
A user who creates a rule occurrence gets assigned as a stakeholder of that particular rule occurrence provided the user has the necessary privilege assigned to the user role. To assign stakeholders to the rule occurrences, the organization administrator must enable the
Data Governance Administrator
privilege for the user role.
For more information about the stakeholders, see the
Asset Details
and
Working with Assets
guides in the
Data Governance and Catalog
documentation.

View notifications for status change of scores

When there is a score status change to the rule occurrence, an alert or notification is generated in
Data Governance and Catalog
. You can view application notifications and receive email notifications for the changes to the rule occurrence status. To configure email notifications, you can click the settings link ( The gear icon 
			 ) on the
Notifications
page, and then enable the
Email Summary
and
Email Event
options for the
Data Quality
notification type.
You can receive notifications for the following changes to the rule occurrences:
  • The status changes from good to not acceptable
  • The status changes from acceptable to not acceptable
  • The status changes from good to acceptable
The users or user groups who are assigned as stakeholders to the rule occurrence receive the notifications. A user with the
Governance Administrator
role gets assigned as a stakeholder for the rule occurrences that they create.
To add a user with a custom role as a stakeholder, the user must ensure that the following permissions and privileges are met:
  • Enable create, read, update, and delete permissions on the data quality assets for Metadata Command Center service.
  • Enable the
    Stakesholdership
    feature for the Data Governance and Catalog service.
For more information about the notifications for scores status, see the
Working with Assets
guide in the
Data Governance and Catalog
documentation.

Download rows of rule occurrences and metrics

You can download rows in rule occurrences and metrics from the scorecard dashboard in
Data Governance and Catalog
. You download a maximum of 100 rows to delimited and legend files. To download rows, click the download link from the
Preview of Valid Rows
window.

0 COMMENTS

We’d like to hear from you!