Use the Hive enabler job to link a Hive table to the preferred records table in the repository. If you link a Hive table to the preferred records table in the repository, the data continues to persist in the repository, and the Hive table accesses data from the repository. You do not have to run the Hive enabler job again for the incremental data.
The following image shows how the Hive enabler job links the Hive table to the repository table:
In the configuration file, if you set
StoreAllFields
to false, the repository does not persist all the columns but persists only the columns that you use to index the input data. If you want to view all the columns in Hive, ensure that you set
StoreAllFields
to true in the configuration file when you link the input data.