deduplicate-data
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends