PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Distributed Systems Python Packages

Python packages with the GitHub topic distributed-systems. Sorted by relevance, with stars and monthly downloads.
py4j
py4j

Py4J enables Python programs to dynamically access arbitrary Java objects

111.6M 1K 235
dmlc
xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

43.1M 28K 9K
fugue-project
fugue

A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

2.6M 2K 100
akhundMurad
typeid-python

Python implementation of TypeIDs: type-safe, K-sortable, and globally unique identifiers inspired by Stripe IDs

1.1M 150 17
ag2ai
faststream

FastStream is a powerful and easy-to-use asynchronous Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS, MQTT and Redis.

1M 5K 348
fugue-project
fugue-sql-antlr

A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

989K 2K 100
Eventual-Inc
daft

High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

810K 5K 474
faust-streaming
faust-streaming

Python Stream Processing. A Faust fork

485K 2K 203
dmlc
xgboost-cpu

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

484K 28K 9K
irmen
pyro5

Pyro 5 - Python remote objects

298K 383 46
pytorch
torchft-nightly

Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)

218K 501 64
kalepa
safe-init

Safe Init is a Python library that enhances AWS Lambda functions with advanced error handling, logging, monitoring, and resilience features, providing comprehensive observability and reliability for serverless applications.

99K 6 0
rucio
rucio-clients

Rucio - Scientific Data Management

91K 298 383
pyeventsourcing
eventsourcing

A library for event sourcing in Python.

75K 2K 143
py-sherlock
sherlock

Easy distributed locks for Python with a choice of backends.

41K 378 34
v6d-io
vineyard

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

33K 951 133
pegasus-isi
pegasus-wms-api

Pegasus Workflow Management System - Automate, recover, and debug scientific computations.

29K 224 90
pegasus-isi
pegasus-wms-common

Pegasus Workflow Management System - Automate, recover, and debug scientific computations.

29K 224 90
tastyware
streaq

Fast, async, fully-typed distributed task queue via Redis streams

27K 144 11
bakwc
pysyncobj

A library for replicating your python class between multiple servers, based on raft protocol

23K 751 118
Attumm
redis-dict

Python dictionary with Redis as backend, built for large datasets. Simplifies Redis operations for large-scale and distributed systems. Supports various data types, namespacing, pipelining, and expiration.

22K 76 13
rustakka
atomr-infer

Multi-runtime GPU + remote inference as a supervised actor system on atomr. OpenAI / Anthropic / Gemini / LiteLLM remote runtimes + vLLM / TensorRT / ORT / mistral.rs local; remote-only build compiles zero GPU deps.

22K 0 0
google
google-vizier

Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.

21K 2K 110
v6d-io
vineyard-bdist

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

21K 951 133
    • Data from PyPI, GitHub, ClickHouse, and BigQuery