Table of Contents

Search

  1. Preface
  2. Introduction to Informatica Big Data Management
  3. Connections
  4. Mappings in a Hadoop Environment
  5. Mappings in the Native Environment
  6. Profiles
  7. Native Environment Optimization
  8. POWERCENTERHELP
  9. Data Type Reference

Column Profiles on Hadoop

Column Profiles on Hadoop

You can import a native, Hive, and HDFS data source into the Analyst tool or Developer tool and then run a column profile on it. When you create a column profile, you select the columns, set up filters, and sampling options. Column profile results include value frequency distribution, unique values, null values, and data types.
Complete the following steps to run a column profile on Hadoop:
  1. Open a connection in the Analyst tool or Developer tool to import the native or Hadoop source.
  2. Import the data source as a data object. The Analyst tool or Developer tool saves the data object in the Model repository.
  3. Create a profile on the imported data object.
  4. Set up the configuration options. These options include the run-time settings and Hive connection.
  5. Run the profile to view the results.


Updated July 03, 2018