PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Sentence Segmentation Python Packages

Python packages with the GitHub topic sentence-segmentation. Sorted by relevance, with stars and monthly downloads.
wikimedia
sentencex

A sentence segmentation library with wide language support optimized for speed and utility.

172K 126 15
natasha
razdel

Rule-based token, sentence segmentation for Russian language

102K 281 34
natasha
natasha

Solves basic Russian NLP tasks, API for lower level Natasha projects

49K 1K 116
segment-any-text
wtpsplit

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

28K 1K 83
superlinear-ai
wtpsplit-lite

✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models

19K 39 4
StarlangSoftware
nlptoolkit-corpus

Corpus processing library

3K 3 9
craigtrim
fast-sentence-segment

Fast and Efficient Sentence Segmentation

3K 3 0
zaemyung
sentsplit

A flexible sentence segmentation library using CRF model and regex rules

2K 31 10
nlp-uoregon
trankit

Trankit: A Light-Weight Transformer-based Toolkit for Multilingual Natural Language Processing

2K 795 107
StarlangSoftware
nlptoolkit-corpus-cy

Corpus Processing Library

2K 0 0
hellonlp
hellonlp

NLP tools, word segmentation, sentence segmentation, New-Word-Discovery,新词发现

1K 27 9
mkartawijaya
hasami

A tool to perform sentence segmentation on Japanese text

1K 6 0
veldica
prose-tokenizer

High-precision prose and Markdown tokenization for natural language processing.

927 1 0
bureaucratic-labs
b-labs-models

Pre-trained models for tokenization, sentence segmentation and so on

661 15 5
mawo-ru
mawo-razdel

Продвинутая токенизация для русского языка с SynTagRus паттернами

429 11 0
tc64
spacyss

sentence segmenters for spacy2.0+

312 9 1
Okramjimmy
meitei-senter

Neural sentence boundary detection for Meitei Mayek (Manipuri) using SentencePiece tokenization and a CNN-based spaCy pipeline.

226 0 0
seanghay
khmerpunctuate

Punctuation Restoration for Khmer language

183 5 1
mkartawijaya
py-hasami

A tool to perform sentence segmentation on Japanese text

78 6 0
eaklykova
syntaxcomp

A Python3 package for extracting syntactic complexity measures from CoNLL-U annotations.

64 4 2
segment-any-text
wtpsplit-triton

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

63 1K 83
    • Data from PyPI, GitHub, ClickHouse, and BigQuery