Table of Contents

Search

  1. Introduction to Data Discovery
  2. Data Discovery with Informatica Analyst
  3. Data Discovery with Informatica Developer

Data Discovery Guide

Data Discovery Guide

Column Profile Settings

Column Profile Settings

The sampling options determine whether the Analyst tool runs a column profile on all rows of the data sources or limited number of rows.
The following table describes the column profile settings that you can configure for an enterprise discovery profile:
Option
Description
Enable column profiling
Runs a column profile as part of enterprise discovery.
Exclude approved data types and data domains from the data type and data domain inference in the subsequent profile runs
Excludes the approved data type or data domain from data type and data domain inference from the next profile run.
The following table describes the sampling options that you can configure for an enterprise discovery profile:
Option
Description
All Rows
Runs a column profile on all rows in the data source.
First <number> Rows
The number of rows that you want to run the column profile on. The Analyst tool chooses the rows starting from the first row in the data source.
The following table describes the run-time environment option that you can configure for an enterprise discovery profile:
Option
Description
Native
The Analyst tool submits the profile jobs to the Profiling Service Module. The Profiling Service Module then breaks down the profile jobs into a set of mappings. The Data Integration Service runs these mappings and writes the profile results to the profiling warehouse.
Hadoop
The Data Integration Service pushes the profile logic to the Blaze engine on the Hadoop cluster to run profiles.

0 COMMENTS

We’d like to hear from you!