PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Big Data Processing Python Packages

Python packages with the GitHub topic big-data-processing. Sorted by relevance, with stars and monthly downloads.
souvik-databricks
dlt-with-debug

A lightweight helper utility which allows developers to do interactive pipeline development by having a unified source code for both DLT run and Non-DLT interactive notebook run.

19K 50 9
IncredibleProgress
sweetheart

rock-solid pillars for enterprise-grade solutions

601 2 0
impresso
impresso-text-preparation

🛠️ Python library to import OCR data in various formats into the canonical JSON format defined by the Impresso project.

556 9 3
impresso
impresso-text-importer

🛠️ Python library to import OCR data in various formats into the canonical JSON format defined by the Impresso project.

311 9 3
akovner
jsv

A compact representation of bulk JSON objects.

78 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery