PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Duplicates Python Packages

Python packages with the GitHub topic duplicates. Sorted by relevance, with stars and monthly downloads.
LibreTranslate
removedup

Remove duplicates from parallel corpora

4K 7 1
Hweded
uniqdiff

Stable exact comparison engine foundation for Python and the UniqTools ecosystem.

615 1 0
deplicate
deplicate

Advanced Duplicate File Finder for Python. Nothing is impossible to solve.

450 79 17
zeronyk
imageduplicatefinder

Simple duplication finder for Images, matches on names and then compares image hashes.

358 0 1
veltzer
pyunique

Pyunique helps you get rid of duplicate files

354 0 0
KeyWeeUsr
thebear

Bear - the decluttering deduplicator

242 4 1
hansalemaos
arrayhascher

Fast hash in 2D Arrays (Numpy/Pandas/lists/tuples)

178 1 0
hansalemaos
a-pandas-ex-duplicates-to-df

Creates a DataFrame/Series from duplicates

161 0 0
vuolter
deplicate-cli

Command Line Interface for deplicate.

142 3 1
hansalemaos
dropduplicatesplanb

Drops duplicates in DataFrames with tedious dtypes

140 0 0
yugn
yadupe

Yet another tool to find and remove duplicate files.

133 0 1
hansalemaos
screwduplicates

provides a simple and efficient way to remove duplicates from an iterable (even with unhashable elements, optional order preservation)

103 0 0
hansalemaos
drop-duplicates-nested-list

Drops duplicates from nested list

93 0 0
hansalemaos
duplicateindexer

Find duplicates in multiple lists and return their indices and values.

90 0 0
jmsv
listset

remove duplicates from lists

82 0 0
NicolasBi
dupe-eraser

A command-line tool which automate the deletion of duplicate files based on their hash or perceptual-hash.

75 13 0
hansalemaos
stridesduplicatefinder

Calculate overlapping values between two arrays and return the results as a DataFrame

59 0 0
dealfonso
searchdups

Search for duplicate files

35 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery