Table of Contents

Search

  1. Preface
  2. Introduction to Informatica Big Data Management
  3. Connections
  4. Mappings in the Hadoop Environment
  5. Mapping Objects in the Hadoop Environment
  6. Processing Hierarchical Data on the Spark Engine
  7. Stateful Computing on the Spark Engine
  8. Monitoring Mappings in the Hadoop Environment
  9. Mappings in the Native Environment
  10. Profiles
  11. Native Environment Optimization
  12. Data Type Reference
  13. Complex File Data Object Properties
  14. Function Reference
  15. Parameter Reference

Creating a Single Data Object Profile in Informatica Developer

Creating a Single Data Object Profile in Informatica Developer

You can create a single data object profile for one or more columns in a data object and store the profile object in the Model repository.
  1. In the
    Object Explorer
    view, select the data object you want to profile.
  2. Click
    File
    New
    Profile
    to open the profile wizard.
  3. Select
    Profile
    and click
    Next
    .
  4. Enter a name for the profile and verify the project location. If required, browse to a new location.
  5. Optionally, enter a text description of the profile.
  6. Verify that the name of the data object you selected appears in the
    Data Objects
    section.
  7. Click
    Next
    .
  8. Configure the profile operations that you want to perform. You can configure the following operations:
    • Column profiling
    • Primary key discovery
    • Functional dependency discovery
    • Data domain discovery
    To enable a profile operation, select
    Enabled as part of the "Run Profile" action
    for that operation. Column profiling is enabled by default.
  9. Review the options for your profile.
    You can edit the column selection for all profile types. Review the filter and sampling options for column profiles. You can review the inference options for primary key, functional dependency, and data domain discovery. You can also review data domain selection for data domain discovery.
  10. Review the drill-down options, and edit them if necessary. By default, the
    Enable Row Drilldown
    option is selected. You can edit drill-down options for column profiles. The options also determine whether drill-down operations read from the data source or from staged data, and whether the profile stores result data from previous profile runs.
  11. In the
    Run Settings
    section, choose a run-time environment. Choose
    Native
    ,
    Hive (deprecated)
    , or
    Hadoop
    as the run-time environment. When you choose the Hive or Hadoop option, select a Hadoop connection.
  12. Click
    Finish
    .


Updated November 09, 2018