A typical profiling solution includes an Informatica Analyst service, the Data Integration Service configured as a single node or grid, and profiling warehouse.
The following figure describes four deployment architectures of the profiling solution:
The Informatica domain is the administrative unit for the Informatica environment. The domain is a collection of nodes that represent the machines on which the application services run. The Data Integration Service performs data integration tasks for Informatica Analyst and Informatica Developer. The Profiling Service Module runs profiles. The Profiling Service Module can run profiles on different types of source data, such as flat files, relational sources, and non-relational sources. The profiling warehouse stores profile results. The Analyst Service runs the Analyst tool. You must configure Hadoop pushdown properties for the Data Integration Service to run profiles in the Hadoop environment. You can run the profiles on the Blaze engine or Hive engine in the Hadoop environment.