Table of Contents

Search

  1. Preface
  2. Introduction to Informatica Big Data Management
  3. Connections
  4. Mappings in a Hadoop Environment
  5. Mappings in the Native Environment
  6. Profiles
  7. Native Environment Optimization
  8. POWERCENTERHELP
  9. Data Type Reference

Running Multiple Data Object Profiles on Hadoop

Running Multiple Data Object Profiles on Hadoop

You can run a column profile on multiple data source objects in the Developer tool. The Developer tool uses default column profiling options to generate the results for multiple data sources.
  1. In the
    Object Explorer
    view, select the data objects you want to run a profile on.
  2. Click
    File
    New
    Profile
    to open the
    New Profile
    wizard.
  3. Select
    Multiple Profiles
    and click
    Next
    .
  4. Select the location where you want to create the profiles. You can create each profile at the same location of the data object, or you can specify a common location for the profiles.
  5. Verify that the names of the data objects you selected appear within the
    Data Objects
    section.
    Optionally, click
    Add
    to add another data object.
  6. Optionally, specify the number of rows to profile, and choose whether to run the profile when the wizard completes.
  7. Click
    Next
    .
    The
    Run Settings
    pane appears. You can specify the Hadoop settings.
  8. Select
    Hadoop
    and select a Hive connection.
    You can select both
    Native
    and
    Hadoop
    as the validation environments.
  9. In the
    Run-time Environment
    field, select
    Hadoop
    .
  10. Click
    Finish
    .
  11. Optionally, enter prefix and suffix strings to add to the profile names.
  12. Click
    OK
    .


Updated July 03, 2018