PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Near Duplicate Detection Python Packages

Python packages with the GitHub topic near-duplicate-detection. Sorted by relevance, with stars and monthly downloads.
iscc
iscc

ISCC: International Standard Content Code

449 50 8
justinbt1
akin

Python library for detecting near duplicate texts in a corpus at scale.

423 9 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery