PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Data Lakehouse Python Packages

Python packages with the GitHub topic data-lakehouse. Sorted by relevance, with stars and monthly downloads.
laminlabs
lamindb

Open-source data lakehouse for biology. Context and memory for datasets and models at scale, across infrastructure. Query, trace & validate with a lineage-native lakehouse that supports bio-formats, registries & ontologies. 🍊YC S22

95K 264 25
laminlabs
lamindb-core

Open-source data lakehouse for biology. Context and memory for datasets and models at scale, across infrastructure. Query, trace & validate with a lineage-native lakehouse that supports bio-formats, registries & ontologies. 🍊YC S22

22K 264 25
sdebruyn
dbt-fabric-samdebruyn

Maintained and extended fork combining dbt-fabric and dbt-fabricspark

7K 9 2
PFund-Software-Ltd
pfeed

Data pipeline for algo-trading, getting and storing both real-time and historical data made easy.

416 32 7
realdatadriven
etlx-wrapper

Python wrapper for ETLX CLI to run ETL workflows from Python

244 43 3
    • Data from PyPI, GitHub, ClickHouse, and BigQuery