Preface
Analyst Service
- Analyst Service Overview
- Analyst Service Architecture
- Configuration Prerequisites
- Recycle and Disable the Analyst Service
- Properties for the Analyst Service
- Custom Images in the Analyst Tool
- Process Properties for the Analyst Service
- Creating and Configuring the Analyst Service
- Creating an Analyst Service
Content Management Service
- Content Management Service Overview
- Master Content Management Service
- Content Management Service Architecture
- Probabilistic Models and Classifier Models
- Reference Data Warehouse
  - Orphaned Reference Data
  - Deleting Orphaned Tables
- Recycling and Disabling the Content Management Service
- Content Management Service Properties
- Content Management Service Process Properties
- Creating a Content Management Service
Data Integration Service
- Data Integration Service Overview
- Before You Create the Data Integration Service
- Creating a Data Integration Service
- Data Integration Service Properties
- Data Integration Service Process Properties
- Data Integration Service Compute Properties
  - Execution Options
  - Environment Variables
- Operating System Profiles for the Data Integration Service
- High Availability for the Data Integration Service
  - Data Integration Service Restart and Failover
    - Data Integration Service Failover Configuration
  - Data Integration Service Recovery
Data Integration Service Architecture
- Data Integration Service Architecture Overview
- Data Integration Service Connectivity
- Data Integration Service Components
- Service Components
- Compute Component
- Process Where DTM Instances Run
- Single Node
- Grid
- Logs
Data Integration Service Management
- Data Integration Service Management Overview
- Enable and Disable Data Integration Services and Processes
  - Enable, Disable, or Recycle the Data Integration Service
    - Enabling, Disabling, or Recycling the Service
  - Enable or Disable a Data Integration Service Process
    - Enabling or Disabling a Service Process
- Directories for Data Integration Service Files
- Run Jobs in Separate Processes
  - DTM Process Pool Management
  - Rules and Guidelines when Jobs Run in Separate Processes
- Maintain Connection Pools
- PowerExchange Connection Pools
- Maximize Parallelism for Mappings and Profiles
- Result Set Caching
- Data Object Caching
- Persisting Virtual Data in Temporary Tables
- Content Management for the Profiling Warehouse
- Web Service Security Management
  - HTTP Client Filter
- Pass-through Security
  - Pass-Through Security with Data Object Caching
  - Adding Pass-Through Security
Data Integration Service Grid
- Data Integration Service Grid Overview
  - Grid Configuration by Job Type
- Before You Configure a Data Integration Service Grid
- Grid for Jobs that Run in the Service Process
- Grid for Jobs that Run in Local Mode
- Grid for Jobs that Run in Remote Mode
- Grid and Content Management Service
- Maximum Number of Concurrent Jobs on a Grid
- Editing a Grid
- Deleting a Grid
- Troubleshooting a Grid
Data Integration Service Applications
- Data Integration Service Applications Overview
  - Applications View
- Applications
- Logical Data Objects
- Physical Data Objects
- Mappings
- SQL Data Services
- Web Services
- Workflows
Mass Ingestion Service
- Mass Ingestion Service Overview
- Enable, Disable, or Recycle the Mass Ingestion Service
  - Enabling the Mass Ingestion Service
  - Disabling or Recycling the Mass Ingestion Service
- Mass Ingestion Service Properties
- Mass Ingestion Service Process Properties
- Creating a Mass Ingestion Service
Metadata Access Service
- Metadata Access Service Overview
- Metadata Access Service Architecture
- Metadata Access Service Properties
- Metadata Access Service Process Properties
- High Availability for the Metadata Access Service
  - Metadata Access Service Restart and Failover
- Enable and Disable Metadata Access Services and Processes
  - Enable Disable or Recycle the Metadata Access Service
    - Enabling, Disabling, or Recycling the Service
  - Enable or Disable a Metadata Access Service Process
    - Enabling or Disabling a Service Process
- Creating a Metadata Access Service
- Logs
Metadata Manager Service
- Metadata Manager Service Overview
- Configuring a Metadata Manager Service
- Creating a Metadata Manager Service
- Creating and Deleting Repository Content
- Enabling and Disabling the Metadata Manager Service
- Metadata Manager Service Properties
- Configuring the Associated PowerCenter Integration Service
  - Privileges for the Associated PowerCenter Integration Service User
Model Repository Service
- Model Repository Service Overview
- Monitoring Model Repository
- Model Repository Architecture
- Model Repository Database Requirements
- Enable and Disable Model Repository Services and Processes
  - Enable, Disable, or Recycle the Model Repository Service
    - Enabling, Disabling, or Recycling the Service
  - Enable or Disable a Model Repository Service Process
    - Enabling or Disabling a Service Process
- Properties for the Model Repository Service
- Properties for the Model Repository Service Process
  - Node Properties for the Model Repository Service Process
- High Availability for the Model Repository Service
  - Model Repository Service Restart and Failover
- Model Repository Service Management
- Version Control for the Model Repository Service
- Repository Object Administration
  - Objects View
  - Locked Object Administration
- Creating a Model Repository Service
- Configuring Monitoring Model Repository Service
PowerCenter Integration Service
- PowerCenter Integration Service Overview
- Creating a PowerCenter Integration Service
- Enabling and Disabling PowerCenter Integration Services and Processes
  - Enabling or Disabling a PowerCenter Integration Service Process
  - Enabling or Disabling the PowerCenter Integration Service
- Operating Mode
- PowerCenter Integration Service Properties
- Operating System Profiles for the PowerCenter Integration Service
- Associated Repository for the PowerCenter Integration Service
- PowerCenter Integration Service Processes
- Configuration for the PowerCenter Integration Service Grid
- Load Balancer for the PowerCenter Integration Service
PowerCenter Integration Service Architecture
- PowerCenter Integration Service Architecture Overview
- PowerCenter Integration Service Connectivity
- PowerCenter Integration Service Process
- Load Balancer
- Data Transformation Manager (DTM) Process
- Processing Threads
  - Thread Types
  - Pipeline Partitioning
- DTM Processing
- Grids
  - Workflow on a Grid
  - Session on a Grid
- System Resources
- Code Pages and Data Movement Modes
  - ASCII Data Movement Mode
  - Unicode Data Movement Mode
- Output Files and Caches
High Availability for the PowerCenter Integration Service
- High Availability for the PowerCenter Integration Service Overview
- Resilience
  - PowerCenter Integration Service Client Resilience
  - External Component Resilience
- Restart and Failover
- Recovery
- PowerCenter Integration Service Failover and Recovery Configuration
PowerCenter Repository Service
- PowerCenter Repository Service Overview
- Creating a Database for the PowerCenter Repository
- Creating the PowerCenter Repository Service
- PowerCenter Repository Service Properties
- PowerCenter Repository Service Process Properties
  - Custom Properties for the PowerCenter Repository Service Process
  - Environment Variables
- High Availability for the PowerCenter Repository Service
PowerCenter Repository Management
- PowerCenter Repository Management Overview
- PowerCenter Repository Service and Service Processes
  - Enabling and Disabling a PowerCenter Repository Service
    - Enabling a PowerCenter Repository Service
    - Disabling a PowerCenter Repository Service
  - Enabling and Disabling PowerCenter Repository Service Processes
    - Enabling a PowerCenter Repository Service Process
    - Disabling a PowerCenter Repository Service Process
- Operating Mode
  - Running a PowerCenter Repository Service in Exclusive Mode
  - Running a PowerCenter Repository Service in Normal Mode
- PowerCenter Repository Content
- Enabling Version Control
- Managing a Repository Domain
- Managing User Connections and Locks
- Sending Repository Notifications
- Backing Up and Restoring the PowerCenter Repository
- Copying Content from Another Repository
- Repository Plug-in Registration
  - Registering a Repository Plug-in
  - Unregistering a Repository Plug-in
- Audit Trails
- Repository Performance Tuning
  - Repository Statistics
  - Repository Copy, Back Up, and Restore Processes
PowerExchange Listener Service
- PowerExchange Listener Service Overview
- DBMOVER Statements for the Listener Service
- Creating a Listener Service
- Listener Service Properties
- Editing Listener Service Properties
  - Editing Listener Service General Properties
  - Editing Listener Service Configuration Properties
- Enabling, Disabling, and Restarting the Listener Service
- Listener Service Logs
- Listener Service Restart and Failover
PowerExchange Logger Service
- PowerExchange Logger Service Overview
- Configuration Statements for the Logger Service
- Creating a Logger Service
- Properties of the PowerExchange Logger Service
  - PowerExchange Logger Service General Properties
  - PowerExchange Logger Service Configuration Properties
- Logger Service Management
- Enabling, Disabling, and Restarting the Logger Service
- Logger Service Logs
- Logger Service Restart and Failover
SAP BW Service
- SAP BW Service Overview
- Creating the SAP BW Service
- Enabling and Disabling the SAP BW Service
  - Enabling the SAP BW Service
  - Disabling the SAP BW Service
- Configuring the SAP BW Service Properties
  - General Properties
  - SAP BW Service Properties
- Configuring the Associated Integration Service
- Configuring the SAP BW Service Processes
- Load Balancing for the SAP BW System and the SAP BW Service
- Viewing Log Events
Search Service
- Search Service Overview
- Search Service Architecture
- Search Index
  - Extraction Interval
- Search Request Process
- Search Service Properties
- Search Service Process Properties
- Creating a Search Service
- Enabling the Search Service
- Recycling and Disabling the Search Service
System Services
- System Services Overview
- Email Service
- Resource Manager Service
- Scheduler Service
Test Data Manager Service
- Test Data Manager Service Overview
- Test Data Manager Service Dependencies
- Test Data Manager Service Properties
- Database Connection Strings
- Configuring the Test Data Manager Service
- Creating the Test Data Manager Service
- Enabling and Disabling the Test Data Manager Service
- Editing the Test Data Manager Service
- Deleting the Test Data Manager Service
Test Data Warehouse Service
- Test Data Warehouse Service Overview
- Test Data Warehouse Services Dependencies
- Test Data Warehouse Service Properties
- Creating the Test Data Warehouse Service
- Process Properties for the Test Data Warehouse Service
Web Services Hub
- Web Services Hub Overview
- Creating a Web Services Hub
- Enabling and Disabling the Web Services Hub
- Web Services Hub Properties
- Configuring the Associated Repository
  - Adding an Associated Repository
  - Editing an Associated Repository
Application Service Upgrade
- Application Service Upgrade Overview
  - Privileges to Upgrade Services
  - Service Upgrade from Previous Versions
- Running the Service Upgrade Wizard
- Verify the Model Repository Service Upgrade
  - Object Dependency Graph
  - Maximum Heap Size
Application Service Databases
- Application Service Databases Overview
- Set Up Database User Accounts
- Data Object Cache Database Requirements
- Exception Management Audit Database Requirements
- Metadata Manager Repository Database Requirements
- Model Repository Database Requirements
- PowerCenter Repository Database Requirements
- Profiling Warehouse Requirements
- Reference Data Warehouse Requirements
- Workflow Database Requirements
- Configure Native Connectivity on Service Machines
  - Install Database Client Software
  - Configure Database Client Environment Variables on Linux
Connecting to Databases from Windows
- Connecting to Databases from Windows Overview
- Connecting to an IBM DB2 Universal Database from Windows
  - Configuring Native Connectivity
- Connecting to an Informix Database from Windows
  - Configuring ODBC Connectivity
- Connecting to Microsoft Access and Microsoft Excel from Windows
  - Configuring ODBC Connectivity
- Connecting to a Microsoft SQL Server Database from Windows
  - Configuring Native Connectivity
    - Rules and Guidelines for Microsoft SQL Server
  - Configuring Custom Properties for Microsoft SQL Server
- Connecting to a Netezza Database from Windows
  - Configuring ODBC Connectivity
- Connecting to an Oracle Database from Windows
  - Configuring Native Connectivity
- Connecting to a Sybase ASE Database from Windows
  - Configuring Native Connectivity
- Connecting to a Teradata Database from Windows
  - Configuring ODBC Connectivity
Connecting to Databases from UNIX
- Connecting to Databases from UNIX Overview
- Connecting to an IBM DB2 Universal Database from UNIX
  - Configuring Native Connectivity
- Connecting to an Informix Database from UNIX
  - Configuring ODBC Connectivity
- Connecting to a Microsoft SQL Server Database from UNIX
- Connecting to a Netezza Database from UNIX
  - Configuring ODBC Connectivity
- Connecting to an Oracle Database from UNIX
  - Configuring Native Connectivity
- Connecting to a Sybase ASE Database from UNIX
  - Configuring Native Connectivity
- Connecting to a Teradata Database from UNIX
  - Configuring ODBC Connectivity
- Connecting to an ODBC Data Source
- Sample odbc.ini File
Updating the DynamicSections Parameter of a DB2 Database
- DynamicSections Parameter Overview
- Setting the DynamicSections Parameter
  - Downloading and Installng the DDconnect JDBC Utility
  - Running the Test for JDBC Tool

Application Service Guide

10.2.1
- 10.5.7
- 10.5.6
- 10.5.5.1
- 10.5.4
- 10.5.3
- 10.5.2
- 10.5.1
- 10.5
- 10.4.1
- 10.4.0
- 10.2.2 HotFix 1
- 10.2.2 Service Pack 1
- 10.2.2

Back Next

Maximize Parallelism for Mappings and Profiles

If you have the partitioning option, you can enable the Data Integration Service to maximize parallelism when it runs mappings, runs column profiles, or performs data domain discovery. When you maximize parallelism, the Data Integration Service dynamically divides the underlying data into partitions and processes all of the partitions concurrently.

When you run a profile job, the Data Integration Service converts the profile job into one or more mappings, and then can run those mappings in multiple partitions.

If mappings process large data sets or contain transformations that perform complicated calculations, the mappings can take a long time to process and can cause low data throughput. When you enable partitioning for these mappings, the Data Integration Service uses additional threads to process the mapping. Increasing the number of processing threads increases the load on the node where the mapping runs. If the node contains sufficient CPU bandwidth, concurrently processing rows of data in a mapping can optimize mapping performance.

By default, the

Maximum Parallelism

property is set to 1 for the Data Integration Service. When the Data Integration Service runs a mapping, it separates the mapping into pipeline stages and uses one thread to process each stage. These threads are allocated to reading, transforming, and writing tasks, and they run in parallel.

When you increase the maximum parallelism value, you enable partitioning. The Data Integration Service uses multiple threads to process each pipeline stage.

The Data Integration Service can create partitions for mappings that have physical data as input and output. The Data Integration Service can use multiple partitions to complete the following actions during a mapping run:

Read from flat file, IBM DB2 for LUW, or Oracle sources.

Run transformations.

Write to flat file, IBM DB2 for LUW, or Oracle targets.

Data Integration Service Management

One Thread for Each Pipeline Stage

Multiple Threads for Each Pipeline Stage

Maximum Parallelism Guidelines

Enabling Partitioning for Mappings and Profiles

Optimize Cache and Target Directories for Partitioning

Download Guide

Watch

Comments