PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Corpus Linguistics Python Packages

Python packages with the GitHub topic corpus-linguistics. Sorted by relevance, with stars and monthly downloads.
natasha
nerus

Large silver standart Russian corpus with NER, morphology and syntax markup

1K 74 11
craigtrim
bnc-lookup

Fast word validation using the British National Corpus.

942 0 0
fau-klue
association-measures

Statistical association measures for Python pandas

857 10 2
fatihbozdag
bitig

Next-generation computational stylometry — a Python replacement for R's Stylo

760 0 0
craigtrim
gngram-lookup

Static Hash-Based Lookup for Google Ngram Frequencies

576 0 0
mshakirDr
mfte

MFTE (Multi Feature Tagger of English) Python is the Python version based on Le Foll's MFTE written in Perl. It is extended to include semantic tags from Biber (2006) and Biber et al. (1999), including other specific tags.

516 30 3
IngoKl
textdirectory

TextDirectory allows you to combine multiple text files into one. While doing this, filters and transformations can be applied.

487 11 2
engisalor
sgex

A Python package for the Sketch Engine API

477 8 0
jonathandunn
corpus-similarity

Measure the similarity of text corpora for 74 languages

344 14 3
acqdiv
acqdiv

Pipeline for the ACQDIV Corpus Database

310 1 3
interrogator
conll-df

CONLL-U to Pandas DataFrame

214 31 9
rmalouf
treesearch-ud

High-performance toolkit for querying linguistic dependency parses

209 3 0
edwardseley
lyricscorpora

An unofficial Python API that allows users to create a corpus of lyrical text from their favorite artists and billboard charts

191 18 1
jaaack-wang
lfextractor

A corpus-linguistic tool to extract and search for linguistic features

156 1 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery