To reduce the number of entities extracted from Cloudera Navigator, Metadata Manager filters Hive and Impala query templates. By default, Metadata Manager extracts only the query templates that create new entities or write to existing entities. Metadata Manager does not extract query templates that write data to unknown entities.
When you load a Cloudera Navigator resource, Metadata Manager extracts all Hive and Impala query templates that create new entities or insert data into existing entities.
For example, a Hive query template contains the following query:
INSERT OVERWRITE TABLE sales_sorted SELECT s_neighbor,price from sales ORDER BY price
This query writes data to the sales_sorted Hive table. Therefore, Metadata Manager extracts this Hive query template.
By default, Metadata Manager does not extract Hive or Impala query templates that read data but do not write data. Metadata Manager also does not extract query templates that write data to unknown entities. In Cloudera Navigator, unknown entities are marked with the ellipsis (
...
) icon.
For example, a Hive query template contains the following query:
SELECT * FROM cust WHERE state = 'CA'
This query reads data from the cust Hive table, but it does not write data to any downstream table. Therefore, Metadata Manager does not extract this Hive query template.
You can disable the filtering of Hive and Impala query templates. If you want Metadata Manager to extract all Hive and Impala query templates, disable the
Enable Filtering
option when you create or edit a Cloudera Navigator resource.