PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Corpus Processing Python Packages

Python packages with the GitHub topic corpus-processing. Sorted by relevance, with stars and monthly downloads.
johentsch
ms3

A parser for annotated MuseScore 3 files.

4K 56 6
StarlangSoftware
nlptoolkit-corpus

Corpus processing library

3K 3 9
StarlangSoftware
nlptoolkit-corpus-cy

Corpus Processing Library

2K 0 0
ku-nlp
kyoto-reader

A processor for KyotoCorpus, KWDLC, and AnnotatedFKCCorpus

1K 10 3
Helsinki-NLP
opusfilter

Toolbox for filtering parallel corpora

889 115 26
CentreForDigitalHumanities
ianalyzer-readers

Utilities for extracting XML, HTML, CSV, XLSX, and RDF data with a common interface

440 0 0
jonathandunn
corpus-similarity

Measure the similarity of text corpora for 74 languages

337 14 3
versotym
rhymetagger

A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Spanish poetry

264 34 4
ringoreality
uniblock

uniblock, scoring and filtering corpus with Unicode block information (and more).

165 5 3
    • Data from PyPI, GitHub, ClickHouse, and BigQuery