Table of Contents

Search

  1. Preface
  2. Introduction to Big Data Management Administration
  3. Authentication
  4. Running Mappings on a Cluster with Kerberos Authentication
  5. Authorization
  6. Cluster Configuration
  7. Cloud Provisioning Configuration
  8. Data Integration Service Processing
  9. Connections
  10. Multiple Blaze Instances on a Cluster
  11. Monitoring REST API

Big Data Management Administrator Guide

Big Data Management Administrator Guide

Data Integration Service Process

Data Integration Service Process

The Data Integration Service starts one or more Data Integration Service processes to manage requests to run mapping jobs in the Hadoop environment.
When you run mappings in the Hadoop environment, the following components run within the Data Integration Service process:
  • Mapping Service Module. The Mapping Service Module receives requests to run mappings from clients.
  • Logical Data Transformation Manager (LDTM). The LDTM compiles and optimizes mapping jobs, and it generates the execution workflow that is used to run a mapping on a Hadoop cluster.
  • Workflow Executor Service. The Workflow Executor Service is a part of the Data Transformation Manager (DTM). The Data Integration Service uses the Workflow Executor Service to push jobs to a Hadoop cluster.
The following diagram shows how the components interact with the client, the Hadoop cluster, and the Model Repository Service:
  1. A client submits a mapping execution request to the Data Integration Service. The Mapping Service Module receives the request and stores the job in the queue.
  2. The Mapping Service Module connects to the Model Repository Service to fetch mapping metadata from the Model repository.
  3. The Mapping Service Module passes the mapping to the Logical Data Transformation Manager (LDTM).
  4. The LDTM compiles the mapping and generates the Spark execution workflow. It stores the execution workflow in the Model repository.
  5. The LTDM pushes the execution workflow through the Workflow Executor Service to the cluster for processing.
For more information about the architecture of a Data Integration Service, see the "Data Integration Service Architecture" chapter in the
Informatica Application Service Guide
.

0 COMMENTS

We’d like to hear from you!