Enrich your data with fuzzy logic
One of the special things Tale of Data makes easily available is fuzzy logic matching. You can use it to hunt for duplicates (or "very close duplicates") in your data, or join together data from different sources where the matching has to be tolerant to mistakes.
You often get mistakes if
- the data is coming from separate systems
- some data has been input by hand, maybe without much quality control at the time
- the data is not all that accurate, for example it is okay so far as it sounds when you say it, but varies in terms of how it is written
Tale of Data helps you overcome these problems by using fuzzy logic in its matching and joining operations.
In the video below we have a list of people who have purchased some cars, and we check their names (first and last names) against an offenders register in a central database. We can cope with exact matches as well as approximate ones. This kind of example can be transposed to any kind of approximate matching problem you might encounter with your data.
As a follow-up, don't miss out on an extra video about the "custom fuzzy matching" setting within the enrichment node.