PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Language Processing Python Packages

Python packages with the GitHub topic language-processing. Sorted by relevance, with stars and monthly downloads.
kariminf
aruudy

Arabic prosody (Arud) or "Science of Poetry"

2K 5 1
MycroftAI
padatious

A neural network intent parser

1K 162 42
TheWelcomer
morphseg

A multilingual package for segmenting text into morphemes using supervised deep learning.

1K 2 0
WZBSocialScienceCenter
germalemma

A lemmatizer for German language text

749 95 12
ysenarath
sinling

A collection of NLP tools for Sinhalese (සිංහල).

706 61 20
sefineh-ai
amharic-tokenizer

Syllable-aware BPE tokenizer for the Amharic language (አማርኛ) – fast, accurate, trainable.

440 99 14
john-hawkins
texturizer

A library and command line application for adding different kinds of features derived from columns of raw text.

292 4 1
versotym
rhymetagger

A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Spanish poetry

278 34 4
vane
prosecco

Simple, extendable nlp engine that can extract data based on provided conditions.

274 0 0
ishto7
persianutils

Standardize your Persian text: Preprocessing, Embedding, and more!

232 16 3
Sylhare
simple-lda

:bookmark: simple lda - latent dirichlet allocation

151 2 2
markomanninen
grcriddles

Study and examination of alphabetical and isopsephical riddles of the Ancient Greeks

139 1 0
verifid
ner-d

Python module for Named Entity Recognition (NER) using natural language processing.

132 13 3
syntpump
dclua

Library for word declension

131 5 1
ysenarath
testasasnkaonlytest

A collection of NLP tools for Sinhalese (සිංහල).

88 61 20
mapado
pynlg

``pynlg`` is a pure python re-implementation of [SimpleNLG-EnFr](https://github.com/rali-udem/SimpleNLG-EnFr), a java library enabling bilingual [text surface realisation](https://en.wikipedia.org/wiki/Realization_%28linguistics%29), based on [SimpleNLG](https://github.com/simplenlg/simplenlg).

67 29 10
gerivanc
xphrase

🛠️ XPhrase Generation is a multilingual phrase generator designed for command-line interface (CLI) usage. It creates expressive, randomized phrases using words from 🇧🇷 Portuguese, 🇬🇧 English, and 🇩🇪 German, interlinked with special characters ✨ and digits 🔢

63 1 0
thomasbrockmeier
kpss-py3

Kraaij-Pohlmann Snowball Stemmer

41 0 0
TheWelcomer
testmorphseg

An efficient and easy-to-use morpheme segmentation library

1 2 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery