PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Minhash Python Packages

Python packages with the GitHub topic minhash. Sorted by relevance, with stars and monthly downloads.
ekzhu
datasketch

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW

6.7M 3K 317
beowolx
rensa

High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datasets

83K 240 21
sourmash-bio
sourmash

Quickly search, compare, and analyze genomic and metagenomic data sets.

11K 542 92
justinbt1
akin

Python library for detecting near duplicate texts in a corpus at scale.

411 9 0
src-d
libmhcuda

Weighted MinHash implementation on CUDA (multi-gpu).

358 122 26
serega
gaoya

Locality Sensitive Hashing

190 80 9
lgautier
mashing-pumpkins

Minhash and maxhash library in Python, combining flexibility, expressivity, and performance.

108 22 3
kiwirafe
xiangsi

中文文本相似度计算器

83 171 23
dnbaker
sketch-ds

C++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings

53 157 14
kiwirafe
xiangshi

中文文本相似度计算器

3 171 23
    • Data from PyPI, GitHub, ClickHouse, and BigQuery