Preface
Analyst Service
- Analyst Service Overview
- Analyst Service Architecture
- Configuration Prerequisites
- Recycle and Disable the Analyst Service
- Properties for the Analyst Service
- Custom Images in the Analyst Tool
- Process Properties for the Analyst Service
- Creating and Configuring the Analyst Service
- Creating an Analyst Service
Catalog Service
- Overview
  - Associated Services
- Catalog Service Privileges
- Creating a Catalog Service
  - Configuring the Catalog Service for Azure HDInsight
Content Management Service
- Content Management Service Overview
- Master Content Management Service
- Content Management Service Architecture
- Probabilistic Models and Classifier Models
- Reference Data Warehouse
  - Orphaned Reference Data
  - Deleting Orphaned Tables
- Recycling and Disabling the Content Management Service
- Content Management Service Properties
- Content Management Service Process Properties
- Creating a Content Management Service
Data Integration Service
- Data Integration Service Overview
- Before You Create the Data Integration Service
- Creating a Data Integration Service
- Data Integration Service Properties
- Data Integration Service Process Properties
- Data Integration Service Compute Properties
  - Execution Options
  - Environment Variables
- Operating System Profiles for the Data Integration Service
- High Availability for the Data Integration Service
Data Integration Service Architecture
- Data Integration Service Architecture Overview
- Data Integration Service Connectivity
- Data Integration Service Components
- Service Components
- Compute Component
- Process Where DTM Instances Run
- Single Node
- Grid
- Logs
Data Integration Service Management
- Data Integration Service Management Overview
- Enable and Disable Data Integration Services and Processes
  - Enable, Disable, or Recycle the Data Integration Service
    - Enabling, Disabling, or Recycling the Service
  - Enable or Disable a Data Integration Service Process
    - Enabling or Disabling a Service Process
- Directories for Data Integration Service Files
- Run Jobs in Separate Processes
  - DTM Process Pool Management
  - Rules and Guidelines when Jobs Run in Separate Processes
- Maintain Connection Pools
- PowerExchange Connection Pools
- Maximize Parallelism for Mappings and Profiles
- Result Set Caching
- Data Object Caching
- Persisting Virtual Data in Temporary Tables
- Content Management for the Profiling Warehouse
- Web Service Security Management
  - HTTP Client Filter
- Pass-through Security
  - Pass-Through Security with Data Object Caching
  - Adding Pass-Through Security
Data Integration Service Grid
- Data Integration Service Grid Overview
  - Grid Configuration by Job Type
- Before You Configure a Data Integration Service Grid
- Grid for Jobs that Run in the Service Process
- Grid for Jobs that Run in Local Mode
- Grid for Jobs that Run in Remote Mode
- Grid and Content Management Service
- Maximum Number of Concurrent Jobs on a Grid
- Editing a Grid
- Deleting a Grid
- Troubleshooting a Grid
Data Integration Service REST API
- Data Integration Service REST API Overview
- Accessing the REST API Documentation
- Using the REST API
- Queries
- Rules and Guidelines
Data Integration Service Applications
- Data Integration Service Applications Overview
  - Applications View
- Applications
- Logical Data Objects
- Physical Data Objects
- Mappings
- SQL Data Services
- Web Services
- Workflows
Enterprise Data Preparation Service
- Enterprise Data Preparation Service Overview
- Before You Create the Enterprise Data Preparation Service
- Creating and Managing the Enterprise Data Preparation Service
- Enterprise Data Preparation Service Properties
- Enterprise Data Preparation Service Process Properties
Interactive Data Preparation Service
- Interactive Data Preparation Service Overview
- Before You Create the Interactive Data Preparation Service
- Creating and Managing the Interactive Data Preparation Service
- Interactive Data Preparation Service Properties
- Interactive Data Preparation Service Process Properties
  - HTTP Configuration Options
  - Advanced Options
- Configuring Interactive Data Preparation Service on Grid for Scalability
Informatica Cluster Service
- Overview
  - Informatica Cluster Service Workflow
  - Creating an Informatica Cluster Service
Mass Ingestion Service
- Mass Ingestion Service Overview
- Creating a Mass Ingestion Service
- Enable, Disable, or Recycle the Mass Ingestion Service
  - Enabling the Mass Ingestion Service
  - Disabling or Recycling the Mass Ingestion Service
- Mass Ingestion Service Properties
- Mass Ingestion Service Process Properties
Metadata Access Service
- Metadata Access Service Overview
- Metadata Access Service Architecture
- Metadata Access Service Properties
- Metadata Access Service Process Properties
- High Availability for the Metadata Access Service
  - Metadata Access Service Restart and Failover
- Operating System Profiles for the Metadata Access Service
  - Operating System Profile Components
  - Configuring the Metadata Access Service to Use Operating System Profiles
    - Configuring System Permissions for the Operating System Profile User
    - Enabling the Metadata Access Service to Use Operating System Profiles
- Enable and Disable Metadata Access Services and Processes
  - Enable Disable or Recycle the Metadata Access Service
    - Enabling, Disabling, or Recycling the Service
  - Enable or Disable a Metadata Access Service Process
    - Enabling or Disabling a Service Process
- Creating a Metadata Access Service
- Logs
Metadata Manager Service
- Metadata Manager Service Overview
- Configuring a Metadata Manager Service
- Creating a Metadata Manager Service
- Creating and Deleting Repository Content
- Enabling and Disabling the Metadata Manager Service
- Metadata Manager Service Properties
- Configuring the Associated PowerCenter Integration Service
  - Privileges for the Associated PowerCenter Integration Service User
Model Repository Service
- Model Repository Service Overview
- Monitoring Model Repository
- Model Repository Architecture
- Model Repository Database Requirements
- Enable and Disable Model Repository Services and Processes
  - Enable, Disable, or Recycle the Model Repository Service
    - Enabling, Disabling, or Recycling the Service
  - Enable or Disable a Model Repository Service Process
    - Enabling or Disabling a Service Process
- Properties for the Model Repository Service
- Properties for the Model Repository Service Process
  - Node Properties for the Model Repository Service Process
- High Availability for the Model Repository Service
  - Model Repository Service Restart and Failover
- Model Repository Service Management
- Version Control for the Model Repository Service
- Repository Object Administration
  - Objects View
  - Locked Object Administration
- Creating a Model Repository Service
- Configuring Monitoring Model Repository Service
PowerCenter Integration Service
- PowerCenter Integration Service Overview
- Creating a PowerCenter Integration Service
- Enabling and Disabling PowerCenter Integration Services and Processes
  - Enabling or Disabling a PowerCenter Integration Service Process
  - Enabling or Disabling the PowerCenter Integration Service
- Operating Mode
- PowerCenter Integration Service Properties
- Operating System Profiles for the PowerCenter Integration Service
- Associated Repository for the PowerCenter Integration Service
- PowerCenter Integration Service Processes
- Configuration for the PowerCenter Integration Service Grid
- Load Balancer for the PowerCenter Integration Service
PowerCenter Integration Service Architecture
- PowerCenter Integration Service Architecture Overview
- PowerCenter Integration Service Connectivity
- PowerCenter Integration Service Process
- Load Balancer
- Data Transformation Manager (DTM) Process
- Processing Threads
  - Thread Types
  - Pipeline Partitioning
- DTM Processing
- Grids
  - Workflow on a Grid
  - Session on a Grid
- System Resources
- Code Pages and Data Movement Modes
  - ASCII Data Movement Mode
  - Unicode Data Movement Mode
- Output Files and Caches
High Availability for the PowerCenter Integration Service
- High Availability for the PowerCenter Integration Service Overview
- Resilience
  - PowerCenter Integration Service Client Resilience
  - External Component Resilience
- Restart and Failover
- Recovery
- PowerCenter Integration Service Failover and Recovery Configuration
PowerCenter Repository Service
- PowerCenter Repository Service Overview
- Creating a Database for the PowerCenter Repository
- Creating the PowerCenter Repository Service
- PowerCenter Repository Service Properties
- PowerCenter Repository Service Process Properties
  - Custom Properties for the PowerCenter Repository Service Process
  - Environment Variables
- High Availability for the PowerCenter Repository Service
PowerCenter Repository Management
- PowerCenter Repository Management Overview
- PowerCenter Repository Service and Service Processes
  - Enabling and Disabling a PowerCenter Repository Service
    - Enabling a PowerCenter Repository Service
    - Disabling a PowerCenter Repository Service
  - Enabling and Disabling PowerCenter Repository Service Processes
    - Enabling a PowerCenter Repository Service Process
    - Disabling a PowerCenter Repository Service Process
- Operating Mode
  - Running a PowerCenter Repository Service in Exclusive Mode
  - Running a PowerCenter Repository Service in Normal Mode
- PowerCenter Repository Content
- Enabling Version Control
- Managing a Repository Domain
- Managing User Connections and Locks
- Sending Repository Notifications
- Backing Up and Restoring the PowerCenter Repository
- Copying Content from Another Repository
- Repository Plug-in Registration
  - Registering a Repository Plug-in
  - Unregistering a Repository Plug-in
- Audit Trails
- Repository Performance Tuning
  - Repository Statistics
  - Repository Copy, Back Up, and Restore Processes
PowerExchange Listener Service
- PowerExchange Listener Service Overview
- DBMOVER Statements for the Listener Service
- Creating a Listener Service
- Listener Service Properties
- Editing Listener Service Properties
  - Editing Listener Service General Properties
  - Editing Listener Service Configuration Properties
- Enabling, Disabling, and Restarting the Listener Service
- Listener Service Logs
- Listener Service Restart and Failover
PowerExchange Logger Service
- PowerExchange Logger Service Overview
- Configuration Statements for the Logger Service
- Creating a Logger Service
- Properties of the PowerExchange Logger Service
  - PowerExchange Logger Service General Properties
  - PowerExchange Logger Service Configuration Properties
- Logger Service Management
- Enabling, Disabling, and Restarting the Logger Service
- Logger Service Logs
- Logger Service Restart and Failover
SAP BW Service
- SAP BW Service Overview
- Creating the SAP BW Service
- Enabling and Disabling the SAP BW Service
  - Enabling the SAP BW Service
  - Disabling the SAP BW Service
- Configuring the SAP BW Service Properties
  - General Properties
  - SAP BW Service Properties
- Configuring the Associated Integration Service
- Configuring the SAP BW Service Processes
- Load Balancing for the SAP BW System and the SAP BW Service
- Viewing Log Events
Search Service
- Search Service Overview
- Search Service Architecture
- Search Index
  - Extraction Interval
- Search Request Process
- Search Service Properties
- Search Service Process Properties
- Creating a Search Service
- Enabling the Search Service
- Recycling and Disabling the Search Service
System Services
- System Services Overview
- Email Service
- Resource Manager Service
- REST Operations Hub Service
  - REST Operations Hub Service Properties
    - General Properties
  - REST Operations Hub Service Process Properties
- Enabling and Disabling the REST Operations Hub Service
- Scheduler Service
Test Data Manager Service
- Test Data Manager Service Overview
- Test Data Manager Service Dependencies
- Test Data Manager Service Properties
- Database Connection Strings
- Configuring the Test Data Manager Service
- Creating the Test Data Manager Service
- Enabling and Disabling the Test Data Manager Service
- Editing the Test Data Manager Service
- Deleting the Test Data Manager Service
Test Data Warehouse Service
- Test Data Warehouse Service Overview
- Test Data Warehouse Services Dependencies
- Test Data Warehouse Service Properties
- Creating the Test Data Warehouse Service
- Process Properties for the Test Data Warehouse Service
Web Services Hub
- Web Services Hub Overview
- Creating a Web Services Hub
- Enabling and Disabling the Web Services Hub
- Web Services Hub Properties
- Configuring the Associated Repository
  - Adding an Associated Repository
  - Editing an Associated Repository
Application Service Upgrade
- Application Service Upgrade Overview
  - Privileges to Upgrade Services
  - Service Upgrade from Previous Versions
- Running the Service Upgrade Wizard
- Verify the Model Repository Service Upgrade
  - Object Dependency Graph
Application Service Databases
- Application Service Databases Overview
- Set Up Database User Accounts
- Data Object Cache Database Requirements
- Exception Management Audit Database Requirements
- Metadata Manager Repository Database Requirements
- Model Repository Database Requirements
- PowerCenter Repository Database Requirements
- Profiling Warehouse Requirements
- Reference Data Warehouse Requirements
- Workflow Database Requirements
- Configure Native Connectivity on Service Machines
  - Install Database Client Software
  - Configure Database Client Environment Variables
Connecting to Databases from Windows
- Connecting to an IBM DB2 Universal Database from Windows
  - Configuring Native Connectivity
- Connecting to an Informix Database from Windows
  - Configuring ODBC Connectivity
- Connecting to Microsoft Access and Microsoft Excel from Windows
  - Configuring ODBC Connectivity
- Connecting to a Microsoft SQL Server Database from Windows
  - Configuring Native Connectivity
    - Rules and Guidelines for Microsoft SQL Server
  - Configuring Custom Properties for Microsoft SQL Server
- Connecting to a Netezza Database from Windows
  - Configuring ODBC Connectivity
- Connecting to an Oracle Database from Windows
  - Configuring Native Connectivity
- Connecting to a Sybase ASE Database from Windows
  - Configuring Native Connectivity
- Connecting to a Teradata Database from Windows
  - Configuring ODBC Connectivity
Connecting to Databases from UNIX or Linux
- Connecting to an IBM DB2 Universal Database
  - Configuring Native Connectivity
- Connecting to a Microsoft SQL Server Database
- Connecting to an Oracle Database
  - Configuring Native Connectivity
- Connecting to a Teradata Database
  - Configuring ODBC Connectivity
- Connecting to a JDBC Data Source
- Connecting to an ODBC Data Source
- Sample odbc.ini File
Updating the DynamicSections Parameter of a DB2 Database
- DynamicSections Parameter Overview
- Setting the DynamicSections Parameter
  - Downloading and Installng the DDconnect JDBC Utility
  - Running the Test for JDBC Tool

Application Service Guide

10.4.0
- 10.5.7
- 10.5.6
- 10.5.5.1
- 10.5.4
- 10.5.3
- 10.5.2
- 10.5.1
- 10.5
- 10.4.1
- 10.2.2 HotFix 1
- 10.2.2 Service Pack 1
- 10.2.2
- 10.2.1

Back Next

Execution Options

The following table describes the execution options for the Data Integration Service:

Property	Description
Use Operating System Profiles and Impersonation	Runs mappings, workflows, and profiling jobs with operating system profiles. In a Hadoop environment, the Data Integration Service uses the Hadoop impersonation user to run mappings, workflows, and profiling jobs. You can select this option if the Data Integration Service runs on UNIX or Linux. To apply changes, restart the Data Integration Service.
Launch Job Options	Runs jobs in the Data Integration Service process, in separate DTM processes on the local node, or in separate DTM processes on remote nodes. Configure the property based on whether the Data Integration Service runs on a single node or a grid and based on the types of jobs that the service runs. Choose one of the following options: In the service process. Configure when you run jobs on a single node or on a grid where each node has both the service and compute roles. In separate local processes. Configure when you run jobs on a single node or on a grid where each node has both the service and compute roles. In separate remote processes. Configure when you run mapping, profile, and workflow jobs on a grid where nodes have a different combination of roles. If you choose this option when the Data Integration Service runs on a single node, then the service runs jobs in separate local processes. You cannot run SQL data service or web service jobs in separate remote processes. Default is in separate local processes. If the Data Integration Service uses operating system profiles, configure to run jobs in separate local processes. If the Data Integration Service runs on UNIX and is configured to run jobs in separate local or remote processes, verify that the host file on each node with the compute role contains a localhost entry. Otherwise, jobs that run in separate processes fail.
Maximum On-Demand Execution Pool Size	Maximum number of on-demand jobs that can run concurrently. Jobs include data previews, profiling jobs, REST and SQL queries, web service requests, and mappings run from the Developer tool. All jobs that the Data Integration Service receives contribute to the on-demand pool size. The Data Integration Service immediately runs on-demand jobs if enough resources are available. Otherwise, the Data Integration Service rejects the job. Default is 10. The maximum on-demand pool size depends on the maximum number of concurrent jobs that a Developer tool client can run on a Data Integration Service. The maximum number of concurrent jobs that a Developer tool client can run is 10.
Maximum Native Batch Execution Pool Size	Maximum number of deployed jobs that can run concurrently in the native environment. The Data Integration Service moves native mapping jobs from the queue to the native job pool when enough resources are available. Default is 10.
Maximum Hadoop Batch Execution Pool Size	Maximum number of deployed jobs that can run concurrently in the Hadoop environment. The Data Integration Service moves Hadoop jobs from the queue to the Hadoop job pool when enough resources are available. Default is 100.
Maximum Memory Size	Maximum amount of memory, in bytes, that the Data Integration Service can allocate for running all requests concurrently when the service runs jobs in the Data Integration Service process. When the Data Integration Service runs jobs in separate local or remote processes, the service ignores this value. If you do not want to limit the amount of memory the Data Integration Service can allocate, set this property to 0. If the value is greater than 0, the Data Integration Service uses the property to calculate the maximum total memory allowed for running all requests concurrently. The Data Integration Service calculates the maximum total memory as follows: Maximum Memory Size + Maximum Heap Size + memory required for loading program components Default is 0. If you run profiles or data quality mappings, set this property to 0.
Maximum Parallelism	Maximum number of parallel threads that process a single mapping pipeline stage. When you set the value greater than 1, the Data Integration Service enables partitioning for mappings, column profiling, and data domain discovery. The service dynamically scales the number of partitions for a mapping pipeline at run time. Increase the value based on the number of CPUs available on the nodes where jobs run. In the Developer tool, developers can change the maximum parallelism value for each mapping. When maximum parallelism is set for both the Data Integration Service and the mapping, the Data Integration Service uses the minimum value when it runs the mapping. You cannot change the maximum parallelism value for each profile. When the Data Integration Service converts a profile job into one or more mappings, the mappings always use Auto for the mapping maximum parallelism. You do not have to set maximum parallelism for the Data Integration Service to use multiple partitions in the Hadoop environment. Default is 1. Maximum is 64.
Hadoop Kerberos Service Principal Name	Service Principal Name (SPN) of the Data Integration Service to connect to a Hadoop cluster that uses Kerberos authentication. Not required when you run the MapR Hadoop distribution. Required for other Hadoop distributions.
Hadoop Kerberos Keytab	The file path to the Kerberos keytab file on the machine on which the Data Integration Service runs. Not required when you run the MapR Hadoop distribution. Required for other Hadoop distributions.
Home Directory	Root directory accessible by the node. This is the root directory for other service directories. Default is <Informatica installation directory>/tomcat/bin . If you change the default value, verify that the directory exists. You cannot use the following characters in the directory path: * ? < > " \| , [ ] This property change does not require a restart of the Data Integration Service.
Temporary Directories	Directory for temporary files created when jobs are run. Default is <home directory>/disTemp . Enter a list of directories separated by semicolons to optimize performance during profile operations and during cache partitioning for Sorter transformations. You cannot use the following characters in the directory path: * ? < > " \| , [ ] This property change does not require a restart of the Data Integration Service.
Cache Directory	Directory for index and data cache files for transformations. Default is <home directory>/cache . Enter a list of directories separated by semicolons to increase performance during cache partitioning for Aggregator, Joiner, or Rank transformations. You cannot use the following characters in the directory path: * ? < > " \| , [ ] This property change does not require a restart of the Data Integration Service.
Source Directory	Directory for source flat files used in a mapping. Default is <home directory>/source . If the Data Integration Service runs on a grid, you can use a shared directory to create one directory for source files. If you configure a different directory for each node with the compute role, ensure that the source files are consistent among all source directories. You cannot use the following characters in the directory path: * ? < > " \| , [ ] This property change does not require a restart of the Data Integration Service.
Target Directory	Default directory for target flat files used in a mapping. Default is <home directory>/target . Enter a list of directories separated by semicolons to increase performance when multiple partitions write to the flat file target. You cannot use the following characters in the directory path: * ? < > " \| , [ ] This property change does not require a restart of the Data Integration Service.
Rejected Files Directory	Directory for reject files. Reject files contain rows that were rejected when running a mapping. Default is <home directory>/reject . You cannot use the following characters in the directory path: * ? < > " \| , [ ] This property change does not require a restart of the Data Integration Service.
Cluster Staging Directory	The directory on the cluster where the Data Integration Service pushes the binaries to integrate the native and non-native environments and to store temporary files during processing. Default is /tmp.
Hadoop Staging User	The HDFS user that performs operations on the Hadoop staging directory. The user needs write permissions on Hadoop staging directory. Default is the Data Integration Service user.
Custom Hadoop OS Path	The local path to the Informatica Hadoop binaries compatible with the Hadoop operating system. Required when the Hadoop cluster and the Data Integration Service are on different supported operating systems. Download and extract the Informatica binaries for the Hadoop cluster on the machine that hosts the Data Integration Service. The Data Integration Service uses the binaries in this directory to integrate the domain with the Hadoop cluster. The Data Integration Service can synchronize the following operating systems: SUSE 11 and Redhat 6.5 Changes take effect after you recycle the Data Integration Service. When you install an Informatica EBF, you must also install it in the path of the Hadoop operating system on the Data Integration Service machine.
Data Engineering Recovery	Indicates whether mapping jobs that run on the Spark engine are recovered when the Data Integration Service processing node fails. Default is False. For more information, see the Informatica Data Engineering Administrator Guide .
State Store	The HDFS location on the cluster to store information about the state of the Spark job. Default is <Home directory >/State Store Configure this property when you configure the run-time properties of a streaming mapping. This property change does not require a restart of the Data Integration Service. For more information about this property, see the Big Data Streaming User Guide .