PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Distributed Training Python Packages

Python packages with the GitHub topic distributed-training. Sorted by relevance, with stars and monthly downloads.
huggingface
timm

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

13.4M 37K 5K
skypilot-org
skypilot

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).

1.8M 10K 1K
paddlepaddle
paddlepaddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

1.8M 24K 6K
Netflix
metaflow

Build, Manage and Deploy AI/ML Systems

716K 10K 1K
skypilot-org
skypilot-nightly

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).

407K 10K 1K
pytorch
torchx

TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.

346K 423 152
meta-pytorch
torchx-nightly

TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.

97K 423 152
tensorcircuit
tensorcircuit-nightly

The next-gen AI-native tensor-network-based quantum software framework

73K 75 19
Netflix
metaflow-stubs

Build, Manage and Deploy AI/ML Systems

69K 10K 1K
paddlepaddle
paddlepaddle-gpu

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

67K 24K 6K
PaddlePaddle
paddlenlp

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

32K 13K 3K
skypilot-org
trainy-skypilot-nightly

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).

17K 10K 1K
PaddlePaddle
tool-helpers

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

10K 13K 3K
PaddlePaddle
fast-dataindex

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

10K 13K 3K
learning-at-home
hivemind

Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.

3K 2K 227
FedML-AI
fedml

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.

3K 4K 766
PanJinquan
basetrainer

Pytorch分布式训练框架

2K 85 11
pytorch
codeflare-torchx

TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.

2K 423 152
eduardoslonski
telescope-ui

Real-time observability dashboard for the Telescope RL post-training framework. Monitor metrics, rollouts, traces, GPU infrastructure, and evals at scale.

1K 3 0
petuum
adaptdl

Dynamic-resource trainer and scheduler for deep learning

1K 459 81
petuum
adaptdl-sched

Dynamic-resource trainer and scheduler for deep learning

1K 459 81
PaddlePaddle
fast-tokenizer-python

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

1K 13K 3K
4paradigm
openembedding

OpenEmbedding is an open source framework for Tensorflow distributed training acceleration.

977 33 6
intelligent-machine-learning
atorch

DLRover: An Automatic Distributed Deep Learning System

955 2K 213
    • Data from PyPI, GitHub, ClickHouse, and BigQuery