The Model Repository Service reads probabilistic model and classifier model file data from the machine that hosts the master Content Management Service in the domain. When you compile a probabilistic model or classifier model in the Developer tool, you update the model files on the master Content Management Service machine.
If a node in the domain runs a Content Management Service, the node stores local copies of the probabilistic model and classifier model files. You specify the local path to the probabilistic and classifier model files in the
NLP Options
property on the Content Management Service. The master Content Management Service synchronizes the probabilistic model and classifier model files on the domain nodes with the master Content Management Service files every 10 minutes.
To synchronize a Content Management Service machine with the current files from the master Content Management Service machine, run the following command:
infacmd cms ResyncData
The command updates the machine that hosts the new service with the probabilistic model or classifier model files from the master Content Management Service machine. When you add a Content Management Service to a domain that includes a master Content Management Service, run the ResyncData command.
You specify a single type of model file when you run the command. To synchronize probabilistic model files and classifier model files, run the command once for each type of model file.
Synchronization Operations
The master Content Management Service stores a list of the Content Management Services in the domain. When the master Content Management Service synchronizes with the domain services, the master Content Management Service copies the current model files sequentially to each domain node. If a node is unavailable, the master Content Management Service moves the node to the end of the list and synchronizes with the next node on the list. After the synchronization operation copies the files to all available Content Management Service machines, the operation ends.
To verify that a synchronization operation succeeded on a node, browse the directory structure on the node and find the probabilistic or classifier model files. Compare the files with the files on the master Content Management Service machine.
Informatica uses the following directory paths as the default locations for the files:
The time required to synchronize the model files depends on the number of files on the master Content Management Service machine. The ResyncData command copies model files in batches of 15 files at a time.