PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Serving Python Packages

Python packages with the GitHub topic serving. Sorted by relevance, with stars and monthly downloads.
ray-project
ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

56.1M 43K 8K
pytorch
torch-model-archiver

Serve, optimize and scale PyTorch models in production

963K 4K 886
Lightning-AI
litserve

A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.

55K 4K 285
pytorch
torchserve

Serve, optimize and scale PyTorch models in production

54K 4K 886
ray-project
ant-ray-cpp-nightly

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

43K 43K 8K
ray-project
ray-cpp

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

42K 43K 8K
pytorch
torch-model-archiver-nightly

Serve, optimize and scale PyTorch models in production

17K 4K 886
SeldonIO
seldon-core

An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models

13K 5K 866
openvinotoolkit
ovmsclient

A scalable inference server for models optimized with OpenVINO™

10K 875 253
clearml
clearml-serving

ClearML - Model-Serving Orchestration and Repository Solution

10K 164 50
pytorch
torch-workflow-archiver

Serve, optimize and scale PyTorch models in production

8K 4K 886
ray-project
ant-ray-nightly

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

7K 43K 8K
pytorch
torch-workflow-archiver-nightly

Serve, optimize and scale PyTorch models in production

7K 4K 886
pytorch
torchserve-nightly

Serve, optimize and scale PyTorch models in production

6K 4K 886
torchpipe
omniback

Serving Inside Pytorch

5K 170 13
polyaxon
haupt

Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon

5K 452 207
friendliai
friendli-client

Friendli Suite Client

4K 50 7
secretflow
secretflow-serving-lib

SecretFlow-Serving is a serving system for privacy-preserving machine learning models.

4K 16 6
ray-project
ant-ray

Ray provides a simple, universal API for building distributed applications.

4K 43K 8K
notAI-tech
fastdeploy

Deploy DL/ ML inference pipelines with minimal extra code.

4K 104 17
PaddlePaddle
paddle-serving-server

A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)

3K 919 246
bodywork-ml
bodywork

ML pipeline orchestration and model deployments on Kubernetes, made really easy.

2K 436 23
PaddlePaddle
paddle-serving-client

A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)

2K 919 246
PaddlePaddle
fastdeploy-python

High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle

2K 4K 744
    • Data from PyPI, GitHub, ClickHouse, and BigQuery