Hi, I'm Ask INFA!
What would you like to know?
ASK INFAPreview
Please to access Ask INFA.

Table of Contents

Search

  1. Preface
  2. Working with Transformations
  3. Aggregator Transformation
  4. Classifier Transformation
  5. Cleanse transformation
  6. Comparison Transformation
  7. Custom Transformation
  8. Custom Transformation Functions
  9. Consolidation Transformation
  10. Data Masking Transformation
  11. Data Masking Examples
  12. Decision Transformation
  13. Dynamic Lookup Cache
  14. Expression Transformation
  15. External Procedure Transformation
  16. Filter Transformation
  17. HTTP Transformation
  18. Identity Resolution Transformation
  19. Java Transformation
  20. Java Transformation API Reference
  21. Java Expressions
  22. Java Transformation Example
  23. Joiner Transformation
  24. Key Generator Transformation
  25. Labeler Transformation
  26. Lookup Transformation
  27. Lookup Caches
  28. Match Transformation
  29. Match Transformations in Field Analysis
  30. Match Transformations in Identity Analysis
  31. Merge Transformation
  32. Normalizer Transformation
  33. Parse transformation
  34. Rank Transformation
  35. Router Transformation
  36. Rule Specification transformation
  37. Sequence Generator Transformation
  38. Sorter Transformation
  39. Source Qualifier Transformation
  40. SQL Transformation
  41. Using the SQL Transformation in a Mapping
  42. Stored Procedure Transformation
  43. Standardizer Transformation
  44. Transaction Control Transformation
  45. Union Transformation
  46. Unstructured Data Transformation
  47. Update Strategy Transformation
  48. Verifier transformation
  49. Weighted Average Transformation
  50. XML Transformations

Transformation Guide

Transformation Guide

Hamming Distance

Hamming Distance

Use the Hamming Distance algorithm when the position of the data characters is a critical factor, for example in numeric or code fields such as telephone numbers, ZIP Codes, or product codes.
The Hamming Distance algorithm calculates a match score for two data strings by computing the number of positions in which characters differ between the data strings. For strings of different length, each additional character in the longest string is counted as a difference between the strings.

Hamming Distance Example

Consider the following strings:
  • Morlow
  • M
    a
    rlow
    es
The highlighted characters indicate the positions that the Hamming algorithm identifies as different.
To calculate the Hamming match score, the transformation divides the number of matching characters (5) by the length of the longest string (8). In this example, the strings are 62.5% similar and the match score is
0.625
.

0 COMMENTS

We’d like to hear from you!