PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Optical Character Recognition Python Packages

Python packages with the GitHub topic optical-character-recognition. Sorted by relevance, with stars and monthly downloads.
jaidedai
easyocr

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

3M 29K 4K
sirfz
tesserocr

A Python wrapper for the tesseract-ocr API

367K 2K 259
mindee
python-doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

318K 6K 644
felixdittrich92
onnxtr

OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR

93K 180 18
amenezes
aiopytesseract

asyncio tesseract wrapper for Tesseract-OCR

2K 27 7
gnana70
ocr-tamil

OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes

989 88 16
by256
imagedataextractor

ImageDataExtractor 2.0 - a Python library for electron microscopy image quantification.

956 21 2
caltechlibrary
handprint

Apply different text recognition services to images of handwritten documents.

660 189 18
OmarSamirz
iftg

IFTG (ImageFromTextGenerator) is a Python package that simplifies creating robust datasets for OCR models. Generate images from text, apply over 10 built-in noise effects, and customize fonts and layouts. IFTG supports all languages and offers endless noise combinations, including custom noise creation.

481 21 2
acsenrafilho
cucaracha

Mr. Franz Cucaracha will be glad to assist you to the document analysis and processing routine

345 1 1
bandrel
ocyara

Performs OCR on image files and scans them for matches to YARA rules

324 42 8
JaidedAI
easyocr-itgn

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

267 29K 4K
verifid
mocr

Meaningful Optical Character Recognition from identity cards with Deep Learning.

265 25 6
jaidedai
nocv2easyocr

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

251 29K 4K
MartinThoma
hasy

Tools for the HASY dataset.

226 36 11
olaflaitinen
thulium-htr

Thulium - State-of-the-Art Multilingual Handwriting Text Recognition for Python

215 8 0
khasbilegt
numiner

MNIST like dataset creation tool for Handwritten Text Recognition.

210 4 0
jaidedai
asone-ocr

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

189 29K 4K
jaidedai
myeasyocr

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

163 29K 4K
cneud
alto-tools

Python tools for performing various operations on ALTO XML files

158 49 15
marieai
marie-ai

Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pipelines (GenAI, LLM, VLLM) into your applications, supporting various tasks such as document cleanup, optical character recognition (OCR), classification, splitting, named entity recognition, and form processing

126 89 11
snakers4
silero-ocr

Simple optical character recognition (OCR) by Silero

109 0 0
jaidedai
axcelocr

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

107 29K 4K
18520339
tfseqrec

TensorFlow 2 Toolkit for Sequence-level Text Recognition that simplifies the process of importing, handling, and visualizing sequence data, as well as providing most used loss functions and evaluation metrics in the development of Sequence Text Recognition models

106 2 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery