You can monitor statistics and view log events for a mapping job in the Monitoring tab of the Administrator tool. You can also monitor mapping jobs for the Blaze engine in the Blaze Job Monitor web application. Monitor mapping jobs for the Spark engine in the YARN web user interface.
The following image shows the Monitor tab in the Administrator tool:
The Monitor tab has the following views:
Summary Statistics
Use the
Summary Statistics
view to view graphical summaries of object states and distribution across the Data Integration Services. You can also view graphs of the memory and CPU that the Data Integration Services used to run the objects.
Execution Statistics
Use the
Execution Statistics
view to monitor properties, run-time statistics, and run-time reports. In the Navigator, you can expand a Data Integration Service to monitor
Ad Hoc Jobs
or expand an application to monitor deployed mapping jobs.
When you select
Ad Hoc Jobs
or deployed mapping jobs from an application in the Navigator of the
Execution Statistics
view, a list of jobs appears in the contents panel. The contents panel groups related jobs based on the job type. You can expand a job type to view the related jobs under it.
Access the following views the
Execution Statistics
view:
Properties
The
Properties
view shows the general properties about the selected job such as name, job type, user who started the job, and start time of the job. If you ran the mapping with the Blaze engine, you can also monitor jobs on the Hadoop cluster from the Monitoring URL that appears for the mapping in the general properties. The Monitoring URL opens the Blaze Job Monitor in a web page. The Blaze Job Monitor displays detailed monitoring statistics for a mapping such as the number of grid tasks, grid segments, or tasklets, and recovery attempts for each tasklet.
Blaze Execution Plan
The
Blaze Execution Plan
view appears when you view a mapping with the Blaze engine in the Hadoop environment and you click
Blaze Execution Plan
. The Blaze execution plan displays the Blaze engine script that the Data Integration Service generates based on the mapping logic. The execution plan includes the tasks that the script depends on. Each script has a unique identifier.
Spark Execution Plan
The
Spark Execution Plan
view appears when you view a mapping with the Spark engine in the Hadoop environment and you click
Spark Execution Plan
. When you view an execution plan for the mapping, the Data Integration Service translates the mapping to a Scala program and an optional set of commands. The execution plan shows the commands and the Scala program code.
Hive Execution Plan
The
Hive Execution Plan
view appears when you view a mapping with the Hive engine in the Hadoop environment and you click
Hive Execution Plan
. The Hive execution plan displays the Hive script that the Data Integration Service generates based on the mapping logic. The execution plan includes the Hive queries and Hive commands. Each script has a unique identifier.
Summary Statistics
The
Summary Statistics
view appears in the details panel when you select a mapping job in the contents panel. The
Summary Statistics
view displays throughput and resource usage statistics for the job run.
Summary statistics are not available for a mapping that runs on the Spark engine.
Detailed Statistics
The
Detailed Statistics
view appears in the details panel when you select a mapping job in the contents panel. The
Detailed Statistics
view displays graphs of the throughput and resource usage statistics for the job run.
Detailed statistics are not available for a mapping that runs on the Spark engine.