PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Text Analysis Python Packages

Python packages with the GitHub topic text-analysis. Sorted by relevance, with stars and monthly downloads.
5j9
wikitextparser

A Python library to parse MediaWiki WikiText

101K 321 25
Lips7
matcher-py

A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust.

30K 18 1
biolab
orange3-text

🍊 :page_facing_up: Text Mining add-on for Orange3

13K 134 86
shcherbak-ai
contextgem

ContextGem: Effortless LLM extraction from documents

13K 2K 156
jboynyc
textnets

Text analysis with networks.

5K 293 23
quadrismegistus
logmap

A hierarchical, context-manager logger utility with multiprocess mapping capabilities

4K 0 0
NationalLibraryOfNorway
dhlab

DHLAB is a library of python modules for accessing text and pictures at the National Library of Norway.

4K 26 4
convosense
convosense-utilities

Email Signature remover - Extracting email body out of the email text in order to get accurate sentiment results, using NLP tasks.

4K 22 2
johnbumgarner
wordhoard

This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.

3K 125 12
twardoch
split-markdown4gpt

A Python tool for splitting large Markdown files into smaller sections based on a specified token limit. This is particularly useful for processing large Markdown files with GPT models, as it allows the models to handle the data in manageable chunks.

2K 29 3
microsoft
autobrewml

With AutoBrewML Framework the time it takes to get production-ready ML models with great ease and efficiency highly accelerates.

2K 25 31
power-of-language
oneai

Python SDK for One AI APIs. One AI is an NLP-as-a-service platform. Our APIs enables language comprehension in context, transforming texts from any source into structured data to use in code.

2K 38 7
rosette-api
rosette-api

Babel Street Analytics Client Library for Python

2K 38 37
BlackMount-ai
blackmount-nlp-mcp

NLP without the bloat — sentiment, keywords, readability, summarization. No NLTK, no spaCy. Zero heavy dependencies.

2K 1 0
nlpie
biomedicus

A biomedical and clinical natural language processing engine.

2K 21 8
meer-khan
pattex

Regex-based pattern extraction library for Python — emails, URLs, phones, IPs, and more.

1K 0 0
welfare-state-analytics
humlab-westac

Welfare State Analytics

1K 5 0
sagnik-chakravarty
arcshiftwrap

Python client for the Arctic Shift API.

1K 1 0
zhiyzuo
python-topic-model-preprocessor

A helper class for facilitating preprocessing of text corpus before any topic modeling algorithms

1K 2 0
MycroftAI
padatious

A neural network intent parser

1K 162 42
nickduran
align

Python library for extracting quantitative, reproducible metrics of multi-level alignment between speakers in naturalistic language corpora.

1K 54 17
direct-phonology
dphon

Tools and algorithms for phonology-aware Early Chinese NLP.

1K 16 1
seandstewart
iambic

Data extraction and rendering library for Shakespearean text.

884 1 0
neplex
architxt

ArchiTXT is a tool for structuring textual data into a valid database model. It is guided by a meta-grammar and uses an iterative process of tree rewriting.

884 5 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery