PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Ocr Python Python Packages

Python packages with the GitHub topic ocr-python. Sorted by relevance, with stars and monthly downloads.
breezedeus
cnocr

CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】

74K 4K 538
ankandrew
fast-plate-ocr

Lightweight & fast OCR models for license plate text recognition.

24K 563 72
StabRise
scaledp

ScaleDP is an Open-Source extension of Apache Spark for Document Processing

3K 18 1
SakuraMathcraft
mathcraft-ocr

A Windows math workspace for screenshot OCR, handwriting-to-LaTeX, editing, preview, and symbolic computation, powered by MathCraft OCR and MathLive.

3K 167 15
LATIS-DocumentAI-Group
documentai-std

The main standards for Latis Document AI project

2K 3 0
Navaneeth-Sharma
aksharajaana

A Kannada OCR

2K 10 3
gnana70
ocr-tamil

OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes

989 88 16
pk5ls20
easypaddleocr

A simple package for PaddleOCR on CPU and GPU using PyTorch

909 13 2
shibing624
imgocr

Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源SOTA,推理速度超快。

836 132 21
maxent-ai
ocrpy

OCR, Archive, Index and Search: Implementation agnostic OCR framework.

564 226 11
sethupavan12
llm-markdownify

Convert documents, images to high-quality Markdown using Vision LLMs. Built for RAG ingestion pipelines.

502 20 1
Anish-M-code
pdftotext3

A simple pdftotext conversion tool for Windows 8.1/10/11 and FEDORA/UBUNTU/DEBIAN/ARCH based linux distros using poppler-utils and Google's tesseract-ocr.

469 22 2
Danielnara24
mistral-ocr-gui

A python package with graphical user interface for processing images with the Mistral OCR API

406 1 0
FREDERICO23
docling-ocr

A powerful Python package for extracting text from images and documents using the SmolDocling-256M-preview advanced LLM-based models.

344 12 1
PSPDFKit
nutrient-dws

Official Python client library for Nutrient Document Web Services API - PDF processing, OCR, watermarking, and document manipulation with automatic Office format conversion

273 54 1
sxaxmz
handle-scanned-pdf

No description available

267 0 1
VerisimilitudeX
ocr-pdf2txt

Use Optical Character Recognition technology to convert scanned PDFs into TXT files locally.

207 1 0
AbsoluteWinter
vocr

Vietnamese OCR

203 0 0
kfur
fineocr

Free OCR that use FineReader (previously FineScanner) Mobile API , due to hardened password

200 2 0
tjkessler
tesseract-positional

Tool to save positional OCR data to a text file

126 0 0
jcspeegs
loups

Extract video chapter timestamps and title screens using template matching and OCR - perfect for sports, podcasts, and content creation

118 0 0
snakers4
silero-ocr

Simple optical character recognition (OCR) by Silero

109 0 0
lollococce
pdfer

A Python library to handle the transformation from PDFs to data

106 1 0
sergiocorreia
quipucamayoc

Tools to extract information from digitized historical documents

98 33 5
    • Data from PyPI, GitHub, ClickHouse, and BigQuery