PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Natural Language Processing Python Packages

Python packages with the GitHub topic natural-language-processing. Sorted by relevance, with stars and monthly downloads.
huggingface
huggingface-hub

The official Python client for the Hugging Face Hub.

256.1M 4K 1K
huggingface
tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

175.3M 11K 1K
huggingface
transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

149.1M 161K 33K
huggingface
datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

127.8M 22K 3K
nltk
nltk

NLTK Source

63.2M 15K 3K
google
sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

32M 12K 1K
explosion
thinc

🔮 A refreshing functional take on deep learning, compatible with your favorite libraries

24.4M 3K 292
explosion
spacy

💫 Industrial-strength Natural Language Processing (NLP) in Python

21.6M 34K 5K
explosion
spacy-loggers

📟 Logging utilities for spaCy

18.2M 12 17
adbar
htmldate

Fast and robust date extraction from web pages, with Python or on the command-line

11.2M 149 30
datamade
usaddress

:us: a python library for parsing unstructured United States address strings into address components

5.7M 2K 308
Unstructured-IO
unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

5.4M 15K 1K
RaRe-Technologies
gensim

Topic Modelling for Humans

5M 16K 4K
sloria
textblob

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

2.6M 10K 1K
pemistahl
lingua-language-detector

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

1.7M 2K 60
openvinotoolkit
openvino

OpenVINOâ„¢ is an open source toolkit for optimizing and deploying AI inference

1.4M 10K 3K
autogluon
autogluon-tabular

Fast and Accurate ML in 3 Lines of Code

1.2M 10K 1K
autogluon
autogluon-core

Fast and Accurate ML in 3 Lines of Code

1.2M 10K 1K
PyThaiNLP
pythainlp

Thai natural language processing in Python

1.1M 1K 297
autogluon
autogluon-features

Fast and Accurate ML in 3 Lines of Code

1.1M 10K 1K
JohnSnowLabs
spark-nlp

State of the Art Natural Language Processing

1.1M 4K 743
autogluon
autogluon-common

Fast and Accurate ML in 3 Lines of Code

1.1M 10K 1K
microsoft
flaml

A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

960K 4K 560
stanfordnlp
stanza

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

907K 8K 942
    • Data from PyPI, GitHub, ClickHouse, and BigQuery