PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Learned Tokenization Python Packages

Python packages with the GitHub topic learned-tokenization. Sorted by relevance, with stars and monthly downloads.
lucidrains
megabyte-pytorch

Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

6K 655 55
lucidrains
rvq-vae-gpt

My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation

568 90 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery