PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Data Discovery Python Packages

Python packages with the GitHub topic data-discovery. Sorted by relevance, with stars and monthly downloads.
datahub-project
acryl-datahub

The Context Platform for your Data and AI Stack

4.3M 12K 3K
reata
sqllineage

SQL Lineage Analysis Tool powered by Python

1.6M 2K 276
datahub-project
acryl-datahub-airflow-plugin

The Context Platform for your Data and AI Stack

1.2M 12K 3K
open-metadata
openmetadata-ingestion

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

426K 14K 2K
linkedin
acryl-executor

The Context Platform for your Data and AI Stack

170K 12K 3K
datahub-project
acryl-datahub-dagster-plugin

The Context Platform for your Data and AI Stack

168K 12K 3K
amundsen-io
amundsen-common

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

129K 5K 966
datahub-project
acryl-datahub-gx-plugin

The Context Platform for your Data and AI Stack

62K 12K 3K
open-metadata
openmetadata-managed-apis

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

28K 14K 2K
datahub-project
prefect-datahub

The Context Platform for your Data and AI Stack

20K 12K 3K
datahub-project
datahub-agent-context

The Context Platform for your Data and AI Stack

20K 12K 3K
recap-cloud
recap-core

Work with your web service, database, and streaming schemas in a single format.

9K 349 26
ywatanabe1989
scitex-dataset

Multi-domain scientific dataset fetcher — neuroscience, biology, pharmacology, medical. Part of SciTeX.

6K 0 0
amundsen-io
amundsen-search

Search Service for Amundsen

3K 5K 966
open-metadata
openmetadata-ingestion-core

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

2K 14K 2K
open-metadata
openmetadata-airflow-managed-apis

Airflow REST APIs to create and manage DAGS

1K 14K 2K
opendatadiscovery
odd-collector-sdk

ODD Collector

1K 4 0
carte-data
carte-cli

A static site generator for data catalogs

1K 29 0
Intugle
intugle

The GenAI-powered toolkit for automated data intelligence.

1K 149 43
related-sciences
articat

articat: data artifact catalog

915 17 2
datahub-project
acryl-datahub-airflow-plugin-hcc-patched

The Context Platform for your Data and AI Stack

750 12K 3K
datahub-project
acryl-datahub-airflow-plugin-patched

The Context Platform for your Data and AI Stack

702 12K 3K
Protegrity-Developer-Edition
protegrity-developer-python

Python module for integrating Protegrity's Data Discovery and Protection APIs into GenAI and traditional applications.

680 9 1
datahub-project
acryl-datahub-tc

The Context Platform for your Data and AI Stack

623 12K 3K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery