PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Pre Training Python Packages

Python packages with the GitHub topic pre-training. Sorted by relevance, with stars and monthly downloads.
helicalAI
helical

A framework for state-of-the-art pre-trained bio foundation models on genomics and transcriptomics modalities.

6K 214 37
datajuicer
py-data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

5K 6K 371
lucidrains
marge-pytorch

Implementation of Marge, Pre-training via Paraphrasing, in Pytorch

940 76 11
lucidrains
electra-pytorch

A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch

827 236 46
lucidrains
coco-lm-pytorch

COCO - Pytorch

746 46 7
lucidrains
mlm-pytorch

An implementation of masked language modeling for Pytorch, made as concise and simple as possible

596 181 24
NVlabs
ps3-torch

Scaling Vision Pre-Training to 4K Resolution

540 227 10
4thel00z
ccdown

A rust based, resumable downloader cli and python library for Common Crawl data

479 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery