PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Streaming Data Python Packages

Python packages with the GitHub topic streaming-data. Sorted by relevance, with stars and monthly downloads.
piskvorky
smart-open

Utils for streaming large files (S3, HDFS, gzip, bz2...)

69.6M 3K 387
online-ml
river

🌊 Online machine learning in Python

240K 6K 626
guillermo-navas-palencia
optbinning

Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.

215K 523 116
quixio
quixstreams

Python Streaming DataFrames for Kafka

87K 2K 106
Sinotrade
shioaji

Shioaji all new cross platform api for trading ( 跨平台證券交易API )

70K 436 32
python-streamz
streamz

Real-time stream processing for python

50K 1K 149
bytewax
bytewax

Python Stream Processing

42K 2K 109
aramisfacchinetti
streaming-json-parser

Streaming JSON parser designed to process JSON data incrementally. The primary goal is to handle potentially incomplete JSON data streams, such as those produced by Large Language Models (LLMs), and return the current state of the parsed object at any time.

9K 13 2
MaterializeInc
dbt-materialize

The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL

6K 6K 504
scikit-multiflow
scikit-multiflow

A machine learning package for streaming data in Python. The other ancestor of River.

4K 794 189
creme-ml
creme

🌊 Online machine learning in Python

4K 6K 626
denisecase
datafun-streaming

Shared Python utilities for Kafka, DuckDB, validation, stats, and visualization across streaming data analytics projects.

3K 1 0
readysettech
rdst

Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches the results of cached select statements and incrementally updates these results over time as the underlying data changes.

3K 5K 159
maki-nage
rxsci

ReactiveX for data science

2K 14 2
sdpython
pandas-streaming

Streaming API for pandas applied to big datasets

2K 31 9
sidkris
streamframe

streamframe is a lightweight engine for computing real-time features over streaming data, with constant-time updates and queries. Written in Rust and available as a Python library.

1K 0 0
selimfirat
pysad

Streaming Anomaly Detection Framework in Python (Outlier Detection for Streaming Data)

1K 287 27
quantfinlib
screamer

Screamingly fast streaming indicators with C++ performance and Python simplicity.

990 4 1
thammo4
uvatradier

wahoowa

975 29 19
streamdal
streamdal-protos

Code-Native Data Privacy

718 615 16
Menziess
slipstream-async

Slipstream provides a data-flow model to simplify development of stateful streaming applications.

638 39 2
Jgprog117
typedkafka

A well-documented, fully type-hinted Kafka client for Python

350 5 0
neurodata
sdtf

Exploring streaming options for decision trees and random forests. Based on scikit-learn fork.

287 9 3
marrow
cinje

A Pythonic and ultra fast template engine DSL.

286 35 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery