Table of Contents

Search

  1. Preface
  2. Introduction to Informatica Big Data Management
  3. Mappings in the Hadoop Environment
  4. Mapping Sources in the Hadoop Environment
  5. Mapping Targets in the Hadoop Environment
  6. Mapping Transformations in the Hadoop Environment
  7. Processing Hierarchical Data on the Spark Engine
  8. Configuring Transformations to Process Hierarchical Data
  9. Processing Unstructured and Semi-structured Data with an Intelligent Structure Model
  10. Stateful Computing on the Spark Engine
  11. Monitoring Mappings in the Hadoop Environment
  12. Mappings in the Native Environment
  13. Profiles
  14. Native Environment Optimization
  15. Cluster Workflows
  16. Connections
  17. Data Type Reference
  18. Function Reference
  19. Parameter Reference

Big Data Management User Guide

Big Data Management User Guide

Creating a Single Data Object Profile in Informatica Developer

Creating a Single Data Object Profile in Informatica Developer

You can create a single data object profile for one or more columns in a data object and store the profile object in the Model repository.
  1. In the
    Object Explorer
    view, select the data object you want to profile.
  2. Click
    File
    New
    Profile
    to open the profile wizard.
  3. Select
    Profile
    and click
    Next
    .
  4. Enter a name for the profile and verify the project location. If required, browse to a new location.
  5. Optionally, enter a text description of the profile.
  6. Verify that the name of the data object you selected appears in the
    Data Objects
    section.
  7. Click
    Next
    .
  8. Configure the profile operations that you want to perform. You can configure the following operations:
    • Column profiling
    • Primary key discovery
    • Functional dependency discovery
    • Data domain discovery
    To enable a profile operation, select
    Enabled as part of the "Run Profile" action
    for that operation. Column profiling is enabled by default.
  9. Review the options for your profile.
    You can edit the column selection for all profile types. Review the filter and sampling options for column profiles. You can review the inference options for primary key, functional dependency, and data domain discovery. You can also review data domain selection for data domain discovery.
  10. Review the drill-down options, and edit them if necessary. By default, the
    Enable Row Drilldown
    option is selected. You can edit drill-down options for column profiles. The options also determine whether drill-down operations read from the data source or from staged data, and whether the profile stores result data from previous profile runs.
  11. In the
    Run Settings
    section, choose a run-time environment. Choose
    Native
    ,
    Hive (deprecated)
    , or
    Hadoop
    as the run-time environment. When you choose the Hive or Hadoop option, select a Hadoop connection.
  12. Click
    Finish
    .

0 COMMENTS

We’d like to hear from you!