Cluster and Categorize Column Data
Effective in version 10.2.2, you can cluster similar values in a column, and then categorize the values based on recommendations from Enterprise Data Lake. The application uses a phonetic algorithm to cluster similar values, and then suggests that you replace the less frequently occurring values with the most frequently occurring value.
For more information, see the "Prepare Data" chapter in the
Informatica 10.2.2 Enterprise Data Lake User Guide.