You can configure a Lookup transformation to cache a relational or flat file lookup source. Enable lookup caching on a large lookup table or file to increase lookup performance.
The Integration Service builds a cache in memory when it processes the first row of data in a cached Lookup transformation. The Integration Service creates caches as the source rows enter the Lookup transformation. It allocates memory for the cache based on the amount you configure in the transformation. The Integration Service stores condition values in the index cache and output values in the data cache. The Integration Service queries the cache for each row that enters the transformation.
If the data does not fit in the memory cache, the Integration Service stores the overflow values in cache files. The Integration Service creates the cache files in the cache directory. By default, the Integration Service creates cache files in the directory specified in the CacheDir system parameter. When the mapping completes, the Integration Service releases cache memory and deletes the cache files unless you configure the Lookup transformation to use a persistent cache.
If you use a flat file lookup, the Integration Service caches the lookup source. If you configure a flat file lookup for sorted input, the Integration Service cannot cache the lookup if the condition columns are not grouped. If the columns are grouped, but not sorted, the Integration Service processes the lookup as if you did not configure sorted input.
When you do not configure the Lookup transformation for caching, the Integration Service queries the lookup source for each input row. Whether or not you cache the lookup source, the result of the Lookup query and processing is the same. However, you can increase lookup performance on a large lookup source if you enable lookup caching.