Table of Contents

Search

  1. Preface
  2. Data Profiling
  3. Profiles
  4. Profile results
  5. Tuning data profiling task performance
  6. Troubleshooting

Data Profiling

Data Profiling

Columns

Columns

The
Columns
tab displays the columns that are supported by
Data Profiling
. You can select or clear the columns on the
Columns
tab. The profile runs on the selected columns to extract column statistics.
Data Profiling
supports the following data types and column precision:
  • Non-numeric data types. Supports columns with a precision from 0 through 4000.
    By default,
    Data Profiling
    selects the columns with a precision from 0 through 150 on the
    Columns
    tab.
  • Numeric data types. Supports columns with a precision from 0 through 38.
    By default,
    Data Profiling
    selects the columns with a precision from 0 through 15 on the
    Columns
    tab.
If you select columns with precision greater than 255 in the
Columns
tab, Data Profiling truncates the value frequency and calculates the statistics based on the first 255 characters on the
Results
page.
The following table lists the properties that you can view on the
Columns
tab:
Property
Description
Columns
The column names in the selected source object.
Type
The data type that appears in the transformations. They are internal data types based on ANSI SQL-92 generic data types, which the Secure Agent uses to move data across platforms. Transformation data types appear in all transformations in a mapping
Precision
The number of characters in a column.
Native Data Type
The data type specific to the source database or flat files.
Scale
The number of numeric characters after the decimal point.
Nullable
Indicates whether the column can accommodate a NULL value.
Key
Indicates whether the column has been designated as a primary key in the data source.
To sort the list of columns in ascending or descending order, click the column name or a property name. Use the
Find
field to search for columns.
When a column is added or deleted in the data source, the list of columns on the
Columns
tab gets updated when you edit the profile. The changes appear only if the runtime environment is up and running. When a column is added, the column appears on the
Columns
tab and is unmarked. You can choose the column for the next profile run. When a column is deleted, the deleted column does not appear on the
Columns
tab.

Example

You are a data steward user. You have created and run a profile called
CustP
on the Customer table. You want to modify the profile based on a business need to classify customers for a new rewards program. To accomplish this task, add the rule to the profile and select the columns in the profile that meet the business need.
To add the rules and select the relevant columns in the profile, perform the following steps:
  1. Modify the
    CustP
    profile.
  2. Choose the columns that meet the business need.
  3. In
    Data Quality
    , create a rule specification for the business need.
  4. In
    Data Profiling
    , add the rule specification to the profile.
  5. Run the profile.
  6. View the profile results to measure the quality of data.
  7. Export the results to a Microsoft Excel file for further analysis.

0 COMMENTS

We’d like to hear from you!