Table of Contents

Search

  1. Preface
  2. Introduction to Informatica Big Data Management
  3. Connections
  4. Mappings in the Hadoop Environment
  5. Mapping Objects in the Hadoop Environment
  6. Monitoring Mappings in the Hadoop Environment
  7. Mappings in the Native Environment
  8. Profiles
  9. Native Environment Optimization
  10. Data Type Reference
  11. Function Reference
  12. Parameter Reference

Hive Engine Monitoring

Hive Engine Monitoring

You can monitor statistics and view log events for a Hive engine mapping job in the Monitor tab of the Administrator tool.
The following image shows the Hive Monitor tab in the Administrator tool:
The Monitor tab has the following views:

Summary Statistics

Use the
Summary Statistics
view to view graphical summaries of object states and distribution across the Data Integration Services. You can also view graphs of the memory and CPU that the Data Integration Services used to run the objects.

Execution Statistics

Use the
Execution Statistics
view to monitor properties, run-time statistics, and run-time reports. In the Navigator, you can expand a Data Integration Service to monitor
Ad Hoc Jobs
or expand an application to monitor deployed mapping jobs or workflows
When you select
Ad Hoc Jobs
, deployed mapping jobs, or workflows from an application in the Navigator of the
Execution Statistics
view, a list of jobs appears in the contents panel. The contents panel groups related jobs based on the job type. You can expand a job type to view the related jobs under it.
Access the following views the
Execution Statistics
view:
Properties
The
Properties
view shows the general properties about the selected job such as name, job type, user who started the job, and start time of the job.
Hive Execution Plan
The Hive execution plan displays the Hive script that the Data Integration Service generates based on the mapping logic. The execution plan includes the Hive queries and Hive commands. Each script has a unique identifier.
Summary Statistics
The
Summary Statistics
view appears in the details panel when you select a mapping job in the contents panel. The
Summary Statistics
view displays throughput and resource usage statistics for the job.
You can view the following throughput statistics for the job:
  • Source. The name of the mapping source file.
  • Target name. The name of the target file.
  • Rows. The number of rows read for source and target. If the target is Hive, this is the only summary statistic available.
  • Average Rows/Sec. Average number of rows read per second for source and target.
  • Bytes. Number of bytes read for source and target.
  • Average Bytes/Sec. Average number of bytes read per second for source and target.
  • First Row Accessed. The date and time when the Data Integration Service started reading the first row in the source file.
  • Dropped rows. Number of source rows that the Data Integration Service did not read.
If you select a Hive mapping in the contents panel, a row called "AllHiveSourceTables" appears in the Summary Statistics view. This row displays the number of rows processed across all sources.
Detailed Statistics
The
Detailed Statistics
view appears in the details panel when you select a mapping job in the contents panel. The
Detailed Statistics
view displays graphs of the throughput and resource usage statistics for the job run.


Updated November 09, 2018