PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Hocr Python Packages

Python packages with the GitHub topic hocr. Sorted by relevance, with stars and monthly downloads.
kreuzberg-dev
html-to-markdown

High performance and CommonMark compliant HTML to Markdown converter. Maintained by the Kreuzberg team. Kreuzberg is a fast, polyglot document intelligence engine with a Rust core. It extracts structured data from 56+ document formats using streaming parsers and built-in OCR.

494K 710 57
stefan6419846
hocr-tools-lib

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

475 4 1
BlueBox-WorldWide
textract-hocr

Convert AWS Textract JSON output to hOCR format

423 0 0
brunomacabeusbr
pyslibtesseract

✏️ Integration of Tesseract for Python using a shared library

206 12 2
    • Data from PyPI, GitHub, ClickHouse, and BigQuery