PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Document Image Processing Python Packages

Python packages with the GitHub topic document-image-processing. Sorted by relevance, with stars and monthly downloads.
Unstructured-IO
unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

5.4M 15K 1K
Layout-Parser
layoutparser

A Unified Toolkit for Deep Learning Based Document Image Analysis

170K 6K 534
Unstructured-IO
unstructured-cpu

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

3K 15K 1K
jchazalon
smartdoc15-ch1

Python wrapper to facilitate data manipulation for the SmartDoc 2015 - Challenge 1 Dataset.

185 7 2
    • Data from PyPI, GitHub, ClickHouse, and BigQuery