, you must create a configuration file. A configuration file includes parameters related to the Hadoop distribution, repository, simple matching rules, input record layout, and metadata.
You can create a matching rules file based on your requirement. A matching rules file includes parameters related to advanced matching rules. The behavior of the batch jobs depends on the parameters that you configure in the configuration file and the matching rules file.
If you plan to consolidate the linked data, you must create a consolidation rules file. The consolidation rules file contains the row, column, and default rules that the consolidation process uses to create a preferred record for each cluster.
If you plan to create relationships between the processed data, you must create a relationship configuration file. A relationship configuration file defines the business entity types and their potential relationships. If you plan to view the relationship graph that you create, you must create a properties file for the relationship graph user interface.