PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Nlp Library Python Packages

Python packages with the GitHub topic nlp-library. Sorted by relevance, with stars and monthly downloads.
explosion
spacy

💫 Industrial-strength Natural Language Processing (NLP) in Python

21.6M 34K 5K
PyThaiNLP
pythainlp

Thai natural language processing in Python

1.1M 1K 297
chrismattmann
tika

Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

417K 2K 250
Ailln
cn2an

📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)

312K 760 83
mocobeta
janome

Japanese morphological analysis engine written in pure Python

268K 913 54
taishi-i
nagisa

A Japanese tokenizer based on recurrent neural networks

226K 417 23
ibm
unitxt

🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data for end-to-end AI benchmarking

77K 212 67
urduhack
urduhack

An NLP library for the Urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way possible.

74K 309 42
CAMeL-Lab
camel-tools

A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.

28K 549 89
OpenPecha
botok

🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python

24K 80 15
wannaphong
laonlp

Lao language Natural Language Processing toolkit

23K 34 6
gandersen101
spaczz

Fuzzy matching and more functionality for spaCy.

15K 259 31
VietHoang1710
khmer-nltk

Khmer natural language processing toolkit

15K 83 19
medspacy
medspacy

Library for clinical NLP with spaCy.

14K 651 110
cdpierse
breame

Lightweight utility tools for the detection of multiple spellings, meanings, and language-specific terminology in British and American English

12K 18 0
nullnull
simstring-pure

A Python implementation of the SimString, a simple and efficient algorithm for approximate string matching.

7K 125 17
proycon
pynlpl

PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).

6K 476 66
Jasonsey
fern2

A model development structure control for NLP

4K 3 1
vikhram-s
indianconstitution

A Python library for exploring the Constitution of India.

4K 3 0
MilaNLProc
contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

4K 1K 152
vinhdq842
soe-vinorm

An effective text normalization tool for Vietnamese

4K 19 8
TakeLab
spacy-udpipe

spaCy + UDPipe

3K 168 9
MIND-LAB
octis

OCTIS: a library for Optimizing and Comparing Topic Models.

3K 800 118
Ars-Linguistica
mlconjug3

A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.

3K 80 13
    • Data from PyPI, GitHub, ClickHouse, and BigQuery