PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Iceberg Python Packages

Python packages with the GitHub topic iceberg. Sorted by relevance, with stars and monthly downloads.
apache
pyiceberg

PyIceberg

43M 1K 494
Eventual-Inc
daft

High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

810K 5K 474
starrocks
starrocks

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.

554K 12K 2K
apache
pydoris-custom

Apache Doris is an easy-to-use, high performance and unified analytics database.

228K 15K 4K
apache
pydoris

Apache Doris is an easy-to-use, high performance and unified analytics database.

92K 15K 4K
lakehq
pysail

Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.

29K 3K 146
projectnessie
pynessie

Nessie: Transactional Catalog for Data Lakes with Git-like semantics

25K 1K 174
mabel-dev
opteryx

🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.

21K 112 14
mabel-dev
opteryx-core

🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.

6K 112 14
Eventual-Inc
daft-lts

High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

6K 5K 474
jghoman
pyducklake

A Python library for Ducklake, providing a pyiceberg-like API

3K 4 1
arrowjet
arrowjet

The fastest way to move data in and out of database.

3K 1 1
apache
dbt-doris

Apache Doris is an easy-to-use, high performance and unified analytics database.

3K 15K 4K
apache
redpanda-polaris-catalog-python

Apache Polaris, the interoperable, open source catalog for Apache Iceberg

3K 2K 444
sidequery
dlt-iceberg

An Iceberg destination for DLT that supports REST catalogs

3K 10 5
legout
duckalog

Build DuckDB catalogs from declarative YAML/JSON configuration files

2K 1 0
Eventual-Inc
daft-qdrant

High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

1K 5K 474
rodmena-limited
datashard

Iceberg robustness, for the rest of us | S3 and Local safe file operations + Pandas support to query your data and logs.

999 4 0
datacoolie
datacoolie

Metadata-driven ETL framework for portable data pipelines across Polars, Spark, Fabric, Databricks, and AWS.

947 8 0
Obsidian-Owl
floe-core

The Open Platform for building Data Platforms. Ship faster. Stay compliant. Scale to Data Mesh.

924 3 11
apache
apache-polaris

Apache Polaris, the interoperable, open source catalog for Apache Iceberg

668 2K 444
srpraneeth
torch-dataloader-utils

Efficient Data Loader Utils for loading data from structured sources into Pytorch

638 0 0
Obsidian-Owl
floe-iceberg

The Open Platform for building Data Platforms. Ship faster. Stay compliant. Scale to Data Mesh.

534 3 11
slidoapp
duckberg

No description available

498 75 5
    • Data from PyPI, GitHub, ClickHouse, and BigQuery