PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Datafusion Python Packages

Python packages with the GitHub topic datafusion. Sorted by relevance, with stars and monthly downloads.
ibis-project
ibis-framework

the portable Python dataframe library

1.9M 7K 722
lakehq
pysail

Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.

29K 3K 146
AndreaBozzo
dataprof

Library and CLI for profiling tabular data

15K 14 1
ryan-evans-git
ematix-flow

Move data between databases, files, and streams from Python. 5.87× faster than PySpark.

10K 0 0
strake-data
strake

The Data Layer for AI. A high-performance federated SQL engine that gives AI agents governed, zero-copy access to your entire data stack (Postgres, S3, APIs).

5K 3 1
roapi
roapi

Create full-fledged APIs for slowly moving datasets without writing a single line of code.

1K 3K 211
madesroches
micromegas

Python analytics client for the Micromegas observability platform

1K 43 6
roapi
roapi-http

No description available

979 3K 211
roapi
columnq-cli

Create full-fledged APIs for slowly moving datasets without writing a single line of code.

936 3K 211
mag1cfrog
timeseries-table-format

Rust-native time-series table format with gap/overlap tracking and SQL queries

750 15 1
jychen7
bigtableql

Query Layer for Google Cloud Bigtable

536 1 0
ibis-project
turntable-spoonbill

the portable Python dataframe library

305 7K 722
lostmygithubaccount
ibis-bench

A composable data system benchmark in a Python package.

292 1 1
georgeleepatterson
clickarrow

ClickHouse Native Protocol Rust Client w/ Arrow Compatibility

231 48 12
jychen7
bigql

Query Layer for Google Cloud Bigtable

221 1 0
lakesoul-io
lakesoul

LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.

147 3K 415
apache
datafusion-cli

Apache DataFusion SQL Query Engine

2 9K 2K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery