How to remove duplicates using fingerprints (aka keys)

If you need to identify duplicate lines in your data, and that there is rather a lot of data to work through, or you would like to use several columns at once as criteria for detecting duplicates, this is the right place to start !

The transformation within Tale of Data is called Multi-Algorithm Deduplication.

Find out more with this video :