PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Simhash Python Packages

Python packages with the GitHub topic simhash. Sorted by relevance, with stars and monthly downloads.
smarthi
pymuvera

Python library for MUVERA multi-vector retrieval via Fixed Dimensional Encodings. ColBERT / ColQwen2 / ColQwen3.5 compatible.

3K 1 0
Marcnuth
deduplication

Remove duplicate documents via popular algorithms such as SimHash, SpotSig, Shingling, etc.

301 18 5
oduwsdl
otmt

Tools for determining if web archive collecions are Off-Topic

243 9 3
saeeddhqan
entropy-hash

EntropyHash: near duplicate detection algorithm

203 0 0
serega
gaoya

Locality Sensitive Hashing

173 80 9
Shangri-la-0428
thronglets

Local AI substrate for agents with sparse signals, hooks, and optional adapters

147 4 1
kiwirafe
xiangsi

中文文本相似度计算器

82 171 23
hybridtheory
floc-simhash

A fast python implementation of the SimHash algorithm.

76 27 7
kiwirafe
xiangshi

中文文本相似度计算器

3 171 23
    • Data from PyPI, GitHub, ClickHouse, and BigQuery