To install address reference data files on the DataNodes in a Hadoop cluster, run the
copyRefDataToComputeNodes.sh
shell script. Or, define a job to run the shell script in a job scheduler application at time intervals that you specify.
Before you run the script or define the job, review the option values that you specify for the script. You can accept the default values or update the values.
Installing the Address Reference Data Files at the Command Prompt
To install the files at the command prompt, perform the following steps:
At the command prompt, open the following directory:
<Informatica installation directory>/tools/dq/av
Run
copyRefDataToComputeNodes.sh
.
Optionally, enter one or more values for the script options. If you do not enter a value for an option, the script runs with the default value for the option.
By default, the script prompts you to confirm the installation of the files. To install the files, enter Y.
Installing the Address Reference Data Files with a Scheduled Job
You can define a job to run the shell script at time intervals that you specify. Add the job to a job scheduler application. If you define a job to install the files, you must disable the prompt to confirm installation.
To disable the prompt, set the following option on the shell script: