PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Data Catalog Python Packages

Python packages with the GitHub topic data-catalog. Sorted by relevance, with stars and monthly downloads.
datahub-project
acryl-datahub

The Context Platform for your Data and AI Stack

4.3M 12K 3K
datahub-project
acryl-datahub-airflow-plugin

The Context Platform for your Data and AI Stack

1.2M 12K 3K
open-metadata
openmetadata-ingestion

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

426K 14K 2K
linkedin
acryl-executor

The Context Platform for your Data and AI Stack

170K 12K 3K
datahub-project
acryl-datahub-dagster-plugin

The Context Platform for your Data and AI Stack

168K 12K 3K
amundsen-io
amundsen-common

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

129K 5K 966
datahub-project
acryl-datahub-gx-plugin

The Context Platform for your Data and AI Stack

62K 12K 3K
intake
intake-esm

An intake plugin for parsing an Earth System Model (ESM) catalog and loading assets into xarray datasets.

31K 160 53
open-metadata
openmetadata-managed-apis

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

28K 14K 2K
docglow
docglow

Modern documentation site generator for dbt Core — lineage explorer, health scoring, full-text search. Live demo: https://demo.docglow.com

27K 90 3
datahub-project
prefect-datahub

The Context Platform for your Data and AI Stack

20K 12K 3K
datahub-project
datahub-agent-context

The Context Platform for your Data and AI Stack

20K 12K 3K
recap-cloud
recap-core

Work with your web service, database, and streaming schemas in a single format.

9K 349 26
omeryasirkucuk
amx-cli

AI-driven CLI for documenting database schemas. DB + docs + codebase agents, 10 backends, BYO LLM, human-in-the-loop review.

6K 16 0
apache
apache-gravitino

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

3K 3K 835
amundsen-io
amundsen-search

Search Service for Amundsen

3K 5K 966
open-metadata
openmetadata-ingestion-core

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

2K 14K 2K
open-metadata
openmetadata-airflow-managed-apis

Airflow REST APIs to create and manage DAGS

1K 14K 2K
gauthierpiarrette
dbt-features

Feature catalog for dbt projects, built for ML teams.

1K 0 0
tokern
piicatcher

Find PII data in databases

1K 341 98
carte-data
carte-cli

A static site generator for data catalogs

1K 29 0
Intugle
intugle

The GenAI-powered toolkit for automated data intelligence.

1K 149 43
related-sciences
articat

articat: data artifact catalog

915 17 2
datahub-project
acryl-datahub-airflow-plugin-hcc-patched

The Context Platform for your Data and AI Stack

750 12K 3K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery