PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Tokenizers Python Packages

Python packages with the GitHub topic tokenizers. Sorted by relevance, with stars and monthly downloads.
megagonlabs
ginza-transformers

Use custom tokenizers in spacy-transformers

31K 16 5
gweidart
rs-bpe

A ridiculously fast Python BPE (Byte Pair Encoder) implementation written in Rust

17K 38 5
Prismadic
llm-magnet

the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly

1K 32 4
1kkiRen
tokenizerchanger

Library for manipulating the existing tokenizer.

715 21 1
bimri
precious-nlp

A tokenizer-free NLP library with T-FREE, CANINE, and byte-level approaches

646 0 0
Hugging-Face-Supporter
tftokenizers

Use Huggingface Transformer and Tokenizers as Tensorflow Reusable SavedModels.

438 10 4
    • Data from PyPI, GitHub, ClickHouse, and BigQuery