PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Data Processing Pipelines Python Packages

Python packages with the GitHub topic data-processing-pipelines. Sorted by relevance, with stars and monthly downloads.
westandskif
convtools

convtools is a specialized Python library for dynamic, declarative data transformations with automatic code generation

7K 40 11
edrewitz
wxdata

A Python package of end-to-end weather data clients & raw data clients with VPN/PROXY support, data processors that decode variable keys from GRIB format into a plain-language format & various tools for assisting Python automated workflows, querying meteorological datasets and filling gaps in meteorological data.

2K 24 1
Plato-solutions
artifician

Artifician is an event driven framework developed to simplify the process of preparation of the dataset for Artificial Intelligence models.

1K 10 0
kaburia
filter-stations

A secure, unified Python interface for African climate data, integrating TAHMO station data and gridded datasets (IMERG, CHIRPS, ERA5, TAMSAT), and medium-to-seasonal weather models

1K 17 4
tamasgal
thepipe

A lightweight, general purpose pipeline framework.

1K 14 2
graphbookai
graphbook

Visual AI development framework for training and inference of ML models, scaling pipelines, and automating workflows with Python

998 49 7
graphbookai
graphbook-huggingface

Visual AI development framework for training and inference of ML models, scaling pipelines, and automating workflows with Python

753 49 7
NVIDIA
invisible-rabbit

Scalable data pre processing and curation toolkit for LLMs

176 2K 267
NVIDIA
invisible-unicorn

Scalable data pre processing and curation toolkit for LLMs

85 2K 267
NVIDIA
lava-ray

Scalable data pre processing and curation toolkit for LLMs

1 2K 267
    • Data from PyPI, GitHub, ClickHouse, and BigQuery