Table of Contents

Search

  1. Preface
  2. Introduction to Informatica Big Data Management
  3. Connections
  4. Mappings in a Hadoop Environment
  5. Mappings in the Native Environment
  6. Profiles
  7. Native Environment Optimization
  8. POWERCENTERHELP
  9. Data Type Reference

Running a Profile on Hadoop in the Analyst Tool

Running a Profile on Hadoop in the Analyst Tool

When you create or edit a profile in the Analyst tool, you can select the run-time environment.
  1. In the
    Discovery Home
    panel, click
    Data Object Profile
    or select
    New
    Data Object profile
    from anywhere in the Analyst tool.
    The
    New Profile
    wizard appears. The
    Column profiling
    option is selected by default.
  2. Click
    Next
    .
  3. In the
    Sources
    pane, select a data object.
  4. Click
    Next
    .
  5. Enter a name and an optional description for the profile.
  6. In the
    Folders
    pane, select the project or folder where you want to create the profile.
    The Analyst tool displays the project that you selected and shared projects that contain folders where you can create the profile. The profiles in the folder appear in the right pane.
  7. Click
    Next
    .
  8. In the
    Columns
    pane, select the columns that you want to run a profile on. The columns include any rules that you applied to the profile. The Analyst tool lists column properties, such as the name, data type, precision, and scale for each column.
    Optionally, select
    Name
    to select all columns.
  9. In the
    Sampling Options
    pane, configure the sampling options.
  10. In the
    Drilldown Options
    pane, configure the drill-down options.
    Optionally, click
    Select Columns
    to select columns to drill down on. In the
    Drilldown columns
    dialog box, select the columns for drilldown and click
    OK
    .
  11. Accept the default option in the
    Profile Results Option
    pane.
    The first time you run the profile, the Analyst tool displays profile results for all columns selected for profiling.
  12. Click
    Next
    .
  13. Optionally, define a filter for the profile.
  14. Click
    Next
    to verify the row drill-down settings including the preview columns for drilldown.
  15. To run the profile in the Hadoop environment, select
    Hive
    and then select a hive connection. The Hive connection helps the Data Integration Service communicate with the Hadoop cluster to push down the profile execution from the Data Integration Service to the Hadoop cluster.
  16. Click
    Save
    to create the profile, or click
    Save & Run
    to create the profile and then run the profile.


Updated July 03, 2018