PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Text Segmentation Python Packages

Python packages with the GitHub topic text-segmentation. Sorted by relevance, with stars and monthly downloads.
mammothb
symspellpy

Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

411K 870 126
catalyst-team
catalyst

Accelerated deep learning R&D

22K 3K 400
craigtrim
fast-sentence-segment

Fast and Efficient Sentence Segmentation

3K 3 0
cbaziotis
ekphrasis

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).

2K 675 94
blmoistawinde
harvesttext

文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法

2K 3K 339
bedapudi6788
deepsegment

Sentence Segmentation with sequece tagging

1K 304 55
TheWelcomer
morphseg

A multilingual package for segmenting text into morphemes using supervised deep learning.

1K 2 0
viig99
symspellcpppy

Fast SymSpell written in c++ and exposes to python via pybind11

773 44 9
sobir-git
tajik-text-segmentation

Tajik text segmentation algorithms

549 1 0
retkowski
chunkseg

Evaluate chaptering quality for audio and video content in time space, supporting segmentation and title generation

545 3 0
craigtrim
lingpatlab

LingPatLab: Linguistic Pattern Laboratory

362 2 0
rlayers
pawpaw

High Performance Text Processing & Segmentation Framework

282 28 4
cspnms
mschunker

Smart text chunker for LLM preprocessing (sections → paragraphs → sentences → hard splits).

200 1 0
catalyst-team
catalyst-pdm

Accelerated deep learning R&D

113 3K 400
TheWelcomer
testmorphseg

An efficient and easy-to-use morpheme segmentation library

1 2 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery