PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Document Understanding Python Packages

Python packages with the GitHub topic document-understanding. Sorted by relevance, with stars and monthly downloads.
deepdoctection
deepdoctection

A Repo For Document AI

8K 3K 191
deepdoctection
dd-core

A Repo For Document AI

4K 3K 191
yuvaraj3855
preocr

Fast document classification and OCR detection. Analyzes any file type to determine if OCR is needed, saving time and money on unnecessary processing.

3K 10 4
deepdoctection
dd-datasets

A Repo For Document AI

3K 3K 191
AI4WA
docs2synth

A Python package for synthesizing and working with document data.

313 2 1
huggingface
chug

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

179 161 10
marimo-marine23
xlmelt

Convert complex Excel files into AI-readable JSON/HTML

78 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery