PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

String Search Python Packages

Python packages with the GitHub topic string-search. Sorted by relevance, with stars and monthly downloads.
ashvardanian
stringzilla

Up to 100x faster strings for C, C++, CUDA, Python, Rust, Swift, JS, & Go, leveraging NEON, AVX2, AVX-512, SVE, GPGPU, & SWAR to accelerate search, hashing, sorting, edit distances, sketches, and memory ops 🦖

4.9M 3K 125
taleinat
fuzzysearch

Find parts of long text or data, allowing for some changes/typos.

784K 342 27
matthewakram
token-fuzz-rs

The fastest token-based fuzzy string matching for very large, static corpora (Rust-backed, Python-first).

13K 3 0
ashvardanian
stringzillas-cpus

Up to 100x faster strings for C, C++, CUDA, Python, Rust, Swift, JS, & Go, leveraging NEON, AVX2, AVX-512, SVE, GPGPU, & SWAR to accelerate search, hashing, sorting, edit distances, sketches, and memory ops 🦖

7K 3K 125
yutanagano
nearust

Fast discovery of similar strings in bulk

4K 2 1
ifplusor
actrie

Aho-Corasick automation for large-scale multi-pattern matching. Available for C/C++, Python, and Java on Linux, macOS, and Windows.

3K 14 5
yutanagano
symscan

Fast discovery of similar strings in bulk

2K 2 1
chen0040
pyalgs

Python implementation of algorithms on string handling, data structure, graph processing, etc

589 12 9
ashvardanian
stringzillas-cuda

Up to 100x faster strings for C, C++, CUDA, Python, Rust, Swift, JS, & Go, leveraging NEON, AVX2, AVX-512, SVE, GPGPU, & SWAR to accelerate search, hashing, sorting, edit distances, sketches, and memory ops 🦖

585 3K 125
chuanconggao
topsim

Efficiently search the most similar strings against the query in Python.

330 18 4
jeffrimko
verace

Python library for checking string consistency between files.

314 1 0
chuanconggao
tagstats

A concise yet efficient implementation for the statistics of each tag's set of key phrases over input lines in Python.

180 1 1
mmkhattab
multimatcher

A convenient implementation of the Aho-Corasick algorithm to efficiently find multiple search patterns and process the matches

114 0 2
chuanconggao
topemoji

�� the most similar ��s.

65 1 1
lqdc
pysimstr

Fast(ish) string similarity for one vs many comparisons.

60 3 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery