PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Unicode Python Packages

Python packages with the GitHub topic unicode. Sorted by relevance, with stars and monthly downloads.
kjd
idna

Internationalized Domain Names for Python (IDNA 2008 and UTS #46)

1.6B 278 120
jawah
charset-normalizer

Truly universal encoding detector in pure Python.

1.4B 770 64
phfaist
pylatexenc

Simple LaTeX parser providing latex-to-unicode and unicode-to-latex conversion

4.9M 414 51
ashvardanian
stringzilla

Up to 100x faster strings for C, C++, CUDA, Python, Rust, Swift, JS, & Go, leveraging NEON, AVX2, AVX-512, SVE, GPGPU, & SWAR to accelerate search, hashing, sorting, edit distances, sketches, and memory ops 🦖

4.9M 3K 125
anyascii
anyascii

Unicode to ASCII transliteration - C Elixir Go Java JS Julia PHP Python Ruby Rust Shell .NET

4.2M 387 29
mlodewijck
pyunormalize

Unicode normalization forms (NFC, NFD, NFKC, NFKD). A pure-Python implementation independent of Python’s core Unicode database, supporting version 17.0 of the Unicode Standard.

4.1M 9 2
vhf
confusable-homoglyphs

ϲοnfuѕаblе_һοmоɡlyphs

1.2M 164 19
aio-libs
idna-ssl

Patch ssl.match_hostname for Unicode(idna) domains support

859K 9 8
dylan-profiler
tangled-up-in-unicode

Access to the Unicode Character Database (UCD)

383K 3 6
bsolomon1124
demoji

Accurately find/replace/remove emojis in text strings

283K 163 21
jtauber
pyuca

a Python implementation of the Unicode Collation Algorithm

256K 225 24
tammoippen
plotille

Plot in the terminal using braille dots.

231K 520 22
pudo
normality

A tiny library for Python text normalisation. Useful for ad-hoc text processing.

227K 157 18
pycontribs
tendo

Official repository of python tendo library, always welcoming new contributions.

226K 147 47
mkalinski
morphys

Smart conversions between unicode and bytes types for common cases in python.

210K 1 2
mjpieters
rtfunicode

Encoder for unicode to RTF 1.5 command sequences

93K 18 4
savioxavier
pyboxen

Incredibly customizable terminal boxes for Python

66K 45 0
python-formate
flake8-encodings

A Flake8 plugin to identify incorrect use of encodings.

48K 7 2
DenverCoder1
table2ascii

An intuitive and type-safe Python library for converting lists to fancy ASCII tables for displaying in the terminal or code-blocks

48K 75 17
olavolav
uniplot

Lightweight plotting to the terminal. 4x resolution via Unicode.

44K 451 23
OpenNMT
pyonmttok

Fast and customizable text tokenization library with BPE and SentencePiece support

44K 333 83
svenkreiss
unicodeit

Converts LaTeX tags to unicode: \mathcal{H} → ℋ. Available on the web or as Automator script for the Mac.

38K 349 39
life4
homoglyphs

Homoglyphs: get similar letters, convert to ASCII, detect possible languages and UTF-8 group.

25K 84 23
Project-Navi
navi-sanitize

Deterministic input sanitization for untrusted text — invisible characters, homoglyphs, and encoding tricks, handled before your code sees them. Zero dependencies, no ML. Python 3.12+.

17K 2 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery