PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Dataengineering Python Packages

Python packages with the GitHub topic dataengineering. Sorted by relevance, with stars and monthly downloads.
datafold
collate-data-diff

Compare tables within or across databases

974K 3K 305
SQLMesh
sqlmesh

Scalable and efficient data transformation framework - backwards compatible with dbt.

518K 3K 383
open-metadata
openmetadata-ingestion

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

426K 14K 2K
datafold
data-diff

Compare tables within or across databases

49K 3K 305
open-metadata
openmetadata-managed-apis

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

28K 14K 2K
zinggAI
zingg

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

5K 1K 168
awslabs
aws-ddk-core

An open source development framework to help you build data workflows and modern data architecture on AWS.

3K 271 24
grai-io
grai-schemas

No description available

2K 314 20
awslabs
aws-orbit-overprovisioning

Launch a Pod for the team space that executes a script given by the user

2K 147 92
grai-io
grai-client

No description available

2K 314 20
afogarty85
camelcasing

Official repository for the camelCasing package

2K 2 0
open-metadata
openmetadata-ingestion-core

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

2K 14K 2K
open-metadata
openmetadata-airflow-managed-apis

Airflow REST APIs to create and manage DAGS

1K 14K 2K
prodmodel
prodmodel

Build data science pipelines and models

1K 58 3
grai-io
grai-source-dbt

No description available

1K 314 20
awslabs
aws-orbit

Data & ML Unified Development and Production Environment.

982 147 92
awslabs
aws-orbit-team-script-launcher

Launch a Pod for the team space that executes a script given by the user

968 147 92
awslabs
aws-orbit-code-commit

Orbit Workbench CodeCommit Plugin.

934 147 92
awslabs
aws-orbit-custom-cfn

Launch a CloudFormation stack for the team space

918 147 92
awslabs
aws-orbit-sdk

AWS Orbit Workbench SDK

902 147 92
awslabs
aws-orbit-redshift

Orbit Workbench Redshift Plugin.

902 147 92
awslabs
aws-orbit-hello-world

Minimal Orbit Workbench Plugin.

898 147 92
awslabs
aws-orbit-emr-on-eks

Allow users to run EMR jobs on their EKS namespace

847 147 92
grai-io
grai-source-postgres

No description available

768 314 20
    • Data from PyPI, GitHub, ClickHouse, and BigQuery