PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Document Classification Python Packages

Python packages with the GitHub topic document-classification. Sorted by relevance, with stars and monthly downloads.
yuvaraj3855
preocr

Fast document classification and OCR detection. Analyzes any file type to determine if OCR is needed, saving time and money on unnecessary processing.

3K 10 4
sergioburdisso
pyss3

A Python library for Interpretable Machine Learning in Text Classification using the SS3 model, with easy-to-use visualization tools for Explainable AI :octocat:

1K 348 44
jfilter
text-classification-keras

Text Classification Library for Keras

859 53 10
ale-grassi
riordino

Intelligent scanned PDF organizer — splits bulk scans into separate, well-named documents using AI

683 1 0
raviqqe
tensorflow-font2char2word2sent2doc

TensorFlow implementation of Hierarchical Attention Networks for Document Classification

635 94 31
sbischoff-ai
document-classifier

A simple CNN for n-class classification of document images

594 2 0
DocsaidLab
docclassifier-docsaid

A zero-shot document classifier.

379 5 1
acsenrafilho
cucaracha

Mr. Franz Cucaracha will be glad to assist you to the document analysis and processing routine

345 1 1
GuillaumeDD
gowpy

A very simple library for exploiting graph-of-words in NLP

172 12 2
hank110
bagofconcepts

This is python implementation of Bag-of-Concepts, as proposed by the paper "Bag-of-Concepts: Comprehending Document Representation through Clustering Words in Distributed Representation"

155 20 1
kk7nc
hdltex

HDLTex: Hierarchical Deep Learning for Text Classification

148 277 66
docuglean-ai
docuglean-ocr

Intelligent document processing. Extract structured data like JSON, Markdown and HTML from documents using AI.

135 115 2
docuglean-ai
docuglean

An SDK for intelligent document processing using SOTA VLLM models

95 115 2
    • Data from PyPI, GitHub, ClickHouse, and BigQuery