PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Low Resource Languages Python Packages

Python packages with the GitHub topic low-resource-languages. Sorted by relevance, with stars and monthly downloads.
EveryVoiceTTS
everyvoice

The EveryVoice TTS Toolkit - Text To Speech for your language

2K 44 4
ljvmiranda921
calamancy

NLP pipelines for Tagalog using spaCy

1K 70 6
sefineh-ai
amharic-tokenizer

Syllable-aware BPE tokenizer for the Amharic language (አማርኛ) – fast, accurate, trainable.

473 99 14
BodduSriPavan-111
chandassu

Chandassu: First Python Library for Global Metrical Poetry

361 14 0
Okramjimmy
meitei-senter

Neural sentence boundary detection for Meitei Mayek (Manipuri) using SentencePiece tokenization and a CNN-based spaCy pipeline.

213 0 0
Jubeerathan
annaparavai

This repository contains the implementation of a transfer learning-based approach for detecting AI-generated product reviews in Tamil and Malayalam. It includes pretrained model embeddings, deep neural networks, and an ensemble method to enhance classification accuracy.

200 0 1
posi-olomo
padie-extended

The first open-source Nigerian language text classifier on PyPI.

196 28 4
alexeyev
tratreetra

Syntactic transfer from more resourced languages: TRAnslating TREEbanks for Syntactic TRAnsfer.

65 0 0
Andrews2017
kkltk

The Kinyarwanda and Kirundi Languages Toolkit (KKLTK) is a Python package for Kinyarwanda and Kirundi languages processing. KKLTK currently provides the sets of stopwords for both languages and other preprocessing tools such as Kinyarwanda and Kirundi tokenizers will be added soon. KKLTK requires Python 3.0, 3.5, 3.6, 3.7, or 3.8.

37 1 2
    • Data from PyPI, GitHub, ClickHouse, and BigQuery