Table of Contents

Search

  1. Introduction to Data Discovery
  2. Data Discovery with Informatica Analyst
  3. Data Discovery with Informatica Developer
  4. Function Support Based on Profiling Warehouse Connection

Data Discovery Guide

Data Discovery Guide

Creating a Column Profile to Perform Data Domain Discovery in Informatica Analyst

Creating a Column Profile to Perform Data Domain Discovery in Informatica Analyst

You need to create at least one data domain before you can create a column profile to perform data domain discovery in the Analyst tool. The profile can discover both column name and column data that match predefined data domains.
  1. In the
    Discovery
    workspace, click
    Profile
    , or select
    New
    Profile
    from anywhere in the Analyst tool.
    The
    New Profile
    wizard appears.
  2. The
    Single source
    option is selected by default. Click
    Next
    .
  3. In the
    Specify General Properties
    screen, enter a name and an optional description for the profile. In the Location field, select the project or folder where you want to create the profile. Click
    Next
    .
  4. In the
    Select Source
    screen, click
    Choose
    to select a data object, or click
    New
    to import a data object. Click
    Next
    .
  5. In the
    Specify Settings
    screen, choose to run a column profile, data domain discovery, or a column profile and data domain discovery. By default, column profile option is selected.
    • Choose
      Run data domain discovery
      to perform data domain discovery. Select the data domain options in the
      Data Domain
      pane.
    • Choose
      Run column profile
      and
      Run data domain discovery
      to run the column profile and data domain discovery. Select the data domain options in the
      Data domain
      pane.
      By default, the columns that you select for column profile is also applicable to data domain discovery. Click
      Edit
      to select or deselect columns for data domain discovery irrespective of the columns that you select for column profile.
    • Choose Data, Columns, or Data and Columns to run data domain discovery on.
    • Choose a sampling option in the
      Run profile on
      pane.
    • Choose a drilldown option in the
      Drilldown
      pane. Optionally, click
      Select Columns
      to select columns to drill down on. You can choose to omit data type and data domain inference for columns with approved data type or data domain.
    • Choose a conformance criteria, and you can select
      Exclude null values from data domain discovery
      option.
    • Choose
      Native
      , or
      Hadoop
      as the run-time environment. You can choose Blaze, or Spark option in the Hadoop run-time environment. If you choose the Blaze option, click
      Choose
      to select a Hadoop connection in the
      Select a Hadoop Connection
      dialog box. If you choose the Spark option, click
      Choose
      to select a Hadoop connection in the
      Select a Hadoop Connection
      dialog box.
  6. In the
    Specify Rules and Filters
    screen, you can add, edit, or delete rules and filters for the profile.
  7. Click
    Save and Finish
    to create the profile, or click
    Save and Run
    to create and run the profile.

0 COMMENTS

We’d like to hear from you!