Table of Contents

Search

  1. Preface
  2. Using Metadata Manager
  3. Configuring Metadata Manager
  4. Application Resources
  5. Business Glossary Resources
  6. Business Intelligence Resources
  7. Custom Resources
  8. Data Integration Resources
  9. Data Modeling Resources
  10. Database Management Resources
  11. Universal Resources
  12. Managing Resources
  13. Loading and Monitoring Resources
  14. Managing Permissions
  15. Resource Migration
  16. Repository Migration
  17. Appendix A: Metadata Manager Login
  18. Appendix B: Metadata Manager Properties Files
  19. Appendix C: Resource Configuration Files
  20. Appendix D: Glossary

Metadata Manager Administrator Guide

Metadata Manager Administrator Guide

Search Query Examples

Search Query Examples

Update the default search query to exclude specific entities from the metadata load.

Example 1: Excluding HDFS Entities in a Specific Directory

Your Cloudera distribution contains a temporary user named "test." When you view the HDFS directory
/user/test
in Cloudera Navigator, you see that all of the files owned by the test user write to the directory
/user/test/.Trash
. Therefore, you do not want Metadata Manager to extract HDFS entities in directory
/user/test
or its subdirectories.
To prevent Metadata Manager from extracting the entities, append the
/user/test
file path to the search query as follows:
NOT ((fileSystemPath:*\/.cloudera_manager_hive_metastore_canary*) OR (fileSystemPath:\/hbase\/oldWALs*) OR (fileSystemPath:\/hbase\/WALs*) OR (fileSystemPath:\/tmp\/logs*) OR (fileSystemPath:\/user\/history\/done*) OR (fileSystemPath:\/tmp\/hive-cloudera*) OR (fileSystemPath:\/tmp\/hive-hive*) OR (fileSystemPath:*\/.Trash*)
OR (fileSystemPath:*\/user\/test*)
)

Example 2: Excluding Job Executions

To prevent Metadata Manager from loading YARN, Oozie, and MapReduce job executions and all Sqoop job templates and executions, update the default search query as follows:
NOT ((fileSystemPath:*\/.cloudera_manager_hive_metastore_canary*) OR (fileSystemPath:\/hbase\/oldWALs*) OR (fileSystemPath:\/hbase\/WALs*) OR (fileSystemPath:\/tmp\/logs*) OR (fileSystemPath:\/user\/history\/done*) OR (fileSystemPath:\/tmp\/hive-cloudera*) OR (fileSystemPath:\/tmp\/hive-hive*) OR (fileSystemPath:*\/.Trash*))
AND NOT (((sourceType:YARN OR sourceType:OOZIE OR sourceType:MAPREDUCE) AND type:OPERATION_EXECUTION) OR sourceType:SQOOP)

0 COMMENTS

We’d like to hear from you!