Effective in version 10.1.1, Informatica Big Data Management installs with a shell script that you can use to install address reference data files. The script installs the reference data files on the compute nodes that you specify.
When you run an address validation mapping in a Hadoop environment, the reference data files must reside on each compute node on which the mapping runs. Use the script to install the reference data files on multiple nodes in a single operation.
The shell script name is
copyRefDataToComputeNodes.sh
.
Find the script in the following directory in the Informatica Big Data Management installation:
[Informatica installation directory]/tools/dq/av
When you run the script, you can enter the following information:
The current location of the reference data files.
The directory to which the script installs the files.
The location of the file that contains the compute node names.
The user name of the user who runs the script.
If you do not enter the information, the script uses a series of default values to identify the file locations and the user name.
For more information, see the
Informatica Big Data Management 10.1.1 Installation and Configuration Guide.