PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Pdf Document Python Packages

Python packages with the GitHub topic pdf-document. Sorted by relevance, with stars and monthly downloads.
chinapandaman
pypdfform

:fire: The Python library for PDF forms.

147K 1K 67
Krasjet
pdf-tocgen

A CLI toolset to generate table of contents for PDF files automatically.

2K 831 28
ArlindNocaj
docbarcodes

Docbarcodes extracts 1D and 2D barcodes from scanned PDF documents or images. It can be used to automate extraction and processing of all kind of documents.

2K 4 1
sameerkumar18
pdfgeneratorapi

PDFGeneratorAPI Python Wrapper

2K 6 1
OthmaneBlial
pdf-editor-offline

PDF Editor Offline: A powerful open-source Free PDF editor that runs 100% offline, ensuring complete privacy and zero cost. Edit, convert, merge, split, compress, organize, and secure PDFs directly on your machine—no cloud uploads, no subscriptions, no accounts. Fully featured with annotations, OCR, batch processing, broad format conversion.

696 4 0
mcagriaksoy
safepdf

SafePDF is a privacy-focused offline tool for PDF manipulation. Merge, compress, split, and organize your PDF files securely: No internet required, your documents stay local and safe.

465 6 1
JustinTheWhale
pdfdarkmode

Converts PDF's to have a grey background to be easier on the eyes

453 17 6
openfun
django-marion-howard

FUN documents for Marion, the documents factory

427 17 0
digidigital
coverup-pdf

A tool for redacting PDF files and images

300 51 11
Magnet-AI
quanta-pdf

Advanced PDF layout analysis engine for extracting figures, tables, and structured content from complex engineering documents using computer vision and machine learning.

270 2 1
StabRise
pyspark-pdf

PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it

243 81 4
Kubenew
pdf2struct

`pdf2struct` extracts structured JSON from PDF documents.

195 1 0
eli64s
pdflex

CLI for merging PDF contexts.

183 3 1
huggingface
chug

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

176 161 10
Elagoht
img2pdf-plus

Merge images into one pdf file including useful optiıns via command line.

122 3 0
lollococce
pdfer

A Python library to handle the transformation from PDFs to data

99 1 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery