PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Byte Pair Encoding Python Packages

Python packages with the GitHub topic byte-pair-encoding. Sorted by relevance, with stars and monthly downloads.
gweidart
rs-bpe

A ridiculously fast Python BPE (Byte Pair Encoder) implementation written in Rust

17K 38 5
crodriguez1a
bpe-summarizer

This summarizer attempts to leverage Byte Pair Encoding (BPE) tokenization and the Bart vocabulary to filter text by semantic meaningfulness.

470 3 1
akhvorov
vgram

Feature extraction from sequential data

316 7 0
DVDAGames
pgn-tokenizer

PGN Tokenizer, a Byte Pair Encoding (BPE) tokenizer for Chess Portable Game Notiation (PGN).

272 0 0
jiauzhang
textok

Text Tokenizer in C++

81 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery