PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Data Governance Python Packages

Python packages with the GitHub topic data-governance. Sorted by relevance, with stars and monthly downloads.
datahub-project
acryl-datahub

The Context Platform for your Data and AI Stack

4.3M 12K 3K
reata
sqllineage

SQL Lineage Analysis Tool powered by Python

1.6M 2K 276
elementary-data
elementary-data

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

1.3M 2K 217
datahub-project
acryl-datahub-airflow-plugin

The Context Platform for your Data and AI Stack

1.2M 12K 3K
open-metadata
openmetadata-ingestion

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

426K 14K 2K
linkedin
acryl-executor

The Context Platform for your Data and AI Stack

170K 12K 3K
datahub-project
acryl-datahub-dagster-plugin

The Context Platform for your Data and AI Stack

168K 12K 3K
datahub-project
acryl-datahub-gx-plugin

The Context Platform for your Data and AI Stack

62K 12K 3K
open-metadata
openmetadata-managed-apis

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

28K 14K 2K
datahub-project
prefect-datahub

The Context Platform for your Data and AI Stack

20K 12K 3K
datahub-project
datahub-agent-context

The Context Platform for your Data and AI Stack

20K 12K 3K
OpenDQV
opendqv

OpenDQV Core — open-source, contract-driven data quality validation engine for data pipelines and API boundaries

13K 10 2
flyersworder
agentic-data-contracts

YAML-first, domain-driven data governance for AI agents — teach agents your business domains, metrics, and rules before they write SQL

7K 8 0
Titan-Systems
titan-core

Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for the Snowflake data warehouse.

2K 481 39
data-drift
driftdb

Metrics Observability & Troubleshooting

2K 331 12
open-metadata
openmetadata-ingestion-core

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

2K 14K 2K
open-metadata
openmetadata-airflow-managed-apis

Airflow REST APIs to create and manage DAGS

1K 14K 2K
MetricProvenance
odgs

Open Data Governance Standard — Sovereign Validation Engine

1K 0 0
datachecks
dcs-core

Open Source Data Quality Monitoring.

1K 171 23
data-drift
datagit

Metrics Observability & Troubleshooting

1K 331 12
Obsidian-Owl
floe-core

The Open Platform for building Data Platforms. Ship faster. Stay compliant. Scale to Data Mesh.

924 3 11
tokern
data-lineage

Generate and Visualize Data Lineage from query history

768 327 45
datahub-project
acryl-datahub-airflow-plugin-hcc-patched

The Context Platform for your Data and AI Stack

750 12K 3K
mesmacosta
datacatalog-util

A Python package to centralize some Google Cloud Data Catalog scripts, this repo contains commands like bulk CSV operations that help leverage Data Catalog features.

713 20 7
    • Data from PyPI, GitHub, ClickHouse, and BigQuery