PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Data Warehouse Python Packages

Python packages with the GitHub topic data-warehouse. Sorted by relevance, with stars and monthly downloads.
dlt-hub
dlt

data load tool (dlt) is an open source Python library that makes data loading easy 🛠️

5.7M 5K 508
elementary-data
elementary-data

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

1.3M 2K 217
PostHog
hogql-parser

🦔 PostHog is an all-in-one developer platform for building successful products. We offer product analytics, web analytics, session replay, error tracking, feature flags, experimentation, surveys, data warehouse, a CDP, and an AI product assistant to help debug your code, ship features faster, and keep all your usage and customer data in one stack.

1.1M 35K 3K
vmware
quickstart-vdk

One framework to develop, deploy and operate data workflows with Python and SQL.

10K 481 67
sdebruyn
dbt-fabric-samdebruyn

Maintained and extended fork combining dbt-fabric and dbt-fabricspark

7K 9 2
crate
dlt-cratedb

dlt destination adapter for CrateDB

7K 0 0
drt-hub
drt-core

Reverse ETL for the code-first data stack

4K 25 37
unytics
bigfunctions

Supercharge BigQuery with BigFunctions

4K 757 70
unytics
airbyte-serverless

Airbyte made simple (no UI, no database, no cluster)

3K 196 17
vmware
vdk-core

Versatile Data Kit SDK Core

3K 481 67
iiasa
ixmp

The ix modeling platform for integrated and cross-cutting scenario analysis

3K 39 114
Titan-Systems
titan-core

Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for the Snowflake data warehouse.

2K 481 39
beneath-hq
beneath

Beneath is a serverless real-time data platform ⚡️

2K 84 10
GClunies
reflekt

Define, govern, and model event data for warehouse-first product analytics.

2K 86 4
vmware
vdk-jupyterlab-extension

One framework to develop, deploy and operate data workflows with Python and SQL.

2K 481 67
ottogroup
koality

Library for data quality monitoring based on duckdb.

1K 4 1
firebolt-db
dbt-firebolt

The dbt adapter for Firebolt

1K 30 11
vmware
vdk-control-cli

One framework to develop, deploy and operate data workflows with Python and SQL.

937 481 67
dlt-hub
dlt-core

dlt is an open-source python-first scalable data loading library that does not require any backend to run.

927 5K 508
vmware
vdk-lineage-model

VDK Lineage Model plugin defines common lineage model and classes used for managing lineageinformation in other VDK plugins.

879 481 67
google
space-datasets

Unified storage framework for the entire machine learning lifecycle

747 155 8
vmware
vdk-trino

One framework to develop, deploy and operate data workflows with Python and SQL.

727 481 67
elementary-data
elementary-lineage

elementary-lineage is deprecated and moved to elementary-data

684 2K 217
vmware
vdk-impala

One framework to develop, deploy and operate data workflows with Python and SQL.

650 481 67
    • Data from PyPI, GitHub, ClickHouse, and BigQuery