PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Data Ingestion Python Packages

Python packages with the GitHub topic data-ingestion. Sorted by relevance, with stars and monthly downloads.
Dynatrace
oneagent-sdk

Enables custom tracing of Python applications in Dynatrace

188K 27 10
bruin-data
ingestr

ingestr is a CLI tool to copy data between any databases with a single command seamlessly.

51K 3K 119
bruin-data
bruin-sdk

Bruin Python SDK — eliminate boilerplate in Bruin Python assets

7K 6 0
startreedata
druid-to-pinot-migrator

Migrate Apache Druid ingestion specs to Apache Pinot artifacts.

5K 1 0
tracebloc
tracebloc-ingestor

tracebloc data pipeline for training/test dataset setup

1K 8 0
MatheusGiacomo
dataforge-dfg

Data Forge is a high-performance, CLI-first data integration tool designed to streamline the lifecycle of data from ingestion to transformation. Built with Python, it provides a robust framework for handling both ETL and ELT workflows with a focus on automation, reliability, and developer experience.

981 1 0
sethupavan12
llm-markdownify

Convert documents, images to high-quality Markdown using Vision LLMs. Built for RAG ingestion pipelines.

494 20 1
RobotStudio
bors

A highly flexible and versatile service integration framework.

364 2 0
zacernst
nanostream

Small-scale stream processing for ETL

317 1 0
paloaltodatabases
sequor

SQL-centric API integration platform

313 87 2
zacernst
metalpipe

Modules for ETL Pipelines

244 1 0
Rudra-K
feedunify

A high-performance, unifying library for data ingestion pipelines from multiple sources.

205 3 0
daq-tools
skeem

Infer SQL DDL statements from tabular data

159 3 1
knifflig
sdmxflow

SDMX ingestion for statistical data warehouses: reproducible datasets, metadata history, and exported codelists in an append-only structure.

120 2 0
avasis-ai
ragpipe-ai

RAG in 3 functions. Ingest any data source into vector databases.

105 3 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery