Pretraining Python Packages

rose-opt

🌹 Rose: Range-Of-Slice Equilibration PyTorch optimizer. Stateless optimization through range-normalized gradient updates.

4K 74 5

ptlasso

A Python package for fitting pretrained Lasso models

2K 1 0

gpt-simple-lm

A clean, readable framework for pretraining language models from scratch.

1K 0 0

forgellm

A comprehensive toolkit for end-to-end continued pre-training, fine-tuning, monitoring, testing and publishing of language models with MLX-LM

1K 4 0

decontaminate

`decon`, but with python API binding.

995 3 0

iltm

iLTM: Integrated Large Tabular Model

845 22 0

alea-preprocess

Accessible, efficient data preprocessing library for pretrain and SFT datasets, including KL3M

733 1 0

ccdown

A rust based, resumable downloader cli and python library for Common Crawl data

559 0 0

zeldarose

Train transformer-based models.

423 28 3

proteinworkshop

Benchmarking framework for protein representation learning. Includes a large number of pre-training and downstream task datasets, models and training/task utilities. (ICLR 2024)

376 274 22

autoevolve

Companion tools for Karpathy's autoresearch - smarter evaluation, guided steering, and multi-agent competitions for GPT pretraining

366 6 1

fleet-x

飞桨大模型开发套件，提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。

353 481 165

chinchilla

A toolkit for scaling law research ⚖

341 68 5

autojudge

Companion tools for Karpathy's autoresearch - smarter evaluation, guided steering, and multi-agent competitions for GPT pretraining

285 6 1

autosteer

Companion tools for Karpathy's autoresearch - smarter evaluation, guided steering, and multi-agent competitions for GPT pretraining

276 6 1

graphg

GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

222 1K 80

lumenspark

Lumenspark is a lightweight Linformer-based Language Model Trained from Scratch

191 1 0

fleet-lightning

No description available

189 481 165

ngab

Benchmarking and generating PE for GNNs via the Graph Alignment task. Code for our paper: Graph Alignment for Benchmarking Graph Neural Networks and Learning Positional Encodings

157 2 0

sotastream

A library for data streaming and augmentation

114 22 4

paddle-fleet

飞桨大模型开发套件，提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。

100 481 165