PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Nlp Machine Learning Python Packages

Python packages with the GitHub topic nlp-machine-learning. Sorted by relevance, with stars and monthly downloads.
chrismattmann
tika

Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

417K 2K 250
bjascob
lemminflect

A python module for English lemmatization and inflection.

130K 280 26
pdrm83
sent2vec

How to encode sentences in a high-dimensional vector space, a.k.a., sentence embedding.

14K 135 12
CLARIN-PL
clarinpl-embeddings

Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language

7K 37 3
deeppavlov
deeppavlov

An open source library for deep learning end-to-end dialog systems and chatbots.

5K 7K 1K
Jasonsey
fern2

A model development structure control for NLP

4K 3 1
MilaNLProc
contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

4K 1K 152
howl-anderson
microtokenizer

A micro tokenizer for Chinese

4K 159 22
Ars-Linguistica
mlconjug3

A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.

3K 80 13
StabRise
scaledp

ScaleDP is an Open-Source extension of Apache Spark for Document Processing

3K 18 1
katanaml
sparrow-parse

Structured data extraction and instruction calling with ML, LLM and Vision LLM

3K 5K 516
StatguyUser
textfeatureselection

Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models

3K 53 5
stonybrooknlp
appworld

🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resource Paper.

2K 419 69
microsoft
autobrewml

With AutoBrewML Framework the time it takes to get production-ready ML models with great ease and efficiency highly accelerates.

2K 25 31
bobxwu
topmost

Topmost: A Topic Modeling System Toolkit

2K 288 27
piteren
torchness

PyTorch tools

2K 0 0
yaniv-shulman
chunkey-bert

ChunkeyBert is a minimal and easy-to-use keyword extraction technique that leverages BERT embeddings for unsupervised keyphrase extraction from long text documents.

2K 1 0
brightertiger
pygarble

Python Package to detect garbled, gibberish text for EN

1K 14 4
maximtrp
bitermplus

Biterm Topic Model (BTM): modeling topics in short texts

1K 85 15
thunlp
openprompt

An Open-Source Framework for Prompt-Learning.

1K 5K 483
MAIF
melusine

📧 Melusine: Use python to automatize your email processing workflow

1K 361 59
NorskRegnesentral
skweak

skweak: A software toolkit for weak supervision applied to NLP tasks

1K 926 77
ahmetozdemirrr
turkish-syllable

A Turkish syllable splitter implemented in C with Python bindings

1K 1 0
Kaleidophon
nlp-uncertainty-zoo

Model zoo for different kinds of uncertainty quantification methods used in Natural Language Processing, implemented in PyTorch.

1K 55 4
    • Data from PyPI, GitHub, ClickHouse, and BigQuery