PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Pdf Converter Python Packages

Python packages with the GitHub topic pdf-converter. Sorted by relevance, with stars and monthly downloads.
docling-project
docling

Get your documents ready for gen AI

7.2M 60K 4K
xhtml2pdf
xhtml2pdf

A library for converting HTML into PDFs using ReportLab

3.9M 2K 656
docling-project
docling-slim

Get your documents ready for gen AI

1.1M 60K 4K
borb-pdf
borb

borb is a library for reading, creating and manipulating PDF files in python.

500K 4K 158
opendatalab
mineru

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

305K 64K 5K
opendataloader-project
opendataloader-pdf

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

134K 21K 2K
opendatalab
magic-pdf

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

76K 64K 5K
abarker
pdfcropmargins

pdfCropMargins -- a program to crop the margins of PDF files

26K 470 41
miikanissi
zebrafy

Python library for converting PDF and images to and from Zebra Programming Language (ZPL).

22K 77 11
aspose-pdf
aspose-pdf

Aspose.PDF for Python via .NET examples and showcase projects

18K 7 0
DS4SD
deepsearch-toolkit

Interact with the Deep Search platform for new knowledge explorations and discoveries

7K 227 32
explosion
spacy-layout

📚 Process PDFs, Word documents and more with spaCy

5K 901 64
opendatalab
mineru-selfhosted-mcp

MCP bridge for a self-hosted MinerU API

5K 64K 5K
raphaelmansuy
edgeparse

EdgeParse converts any digital PDF into Markdown, JSON (with bounding boxes), HTML, or plain text — deterministically, without a JVM, without a GPU, and with best-in-class accuracy on the 200-document benchmark suite included in this repository.

5K 109 13
opendataloader-project
langchain-opendataloader-pdf

A LangChain integration for OpenDataLoader PDF

3K 33 4
benjamin-awd
monopoly-core

Monopoly is a Python library & CLI that converts bank statement PDFs to CSV

2K 172 46
pankajr141
pdf2jpg

Utility to convert PDF into JPG files

2K 58 22
Hugues-DTANKOUO
olgadoc

Four formats. One engine. PDF, DOCX, XLSX, HTML → Markdown and typed JSON, 15–40× faster than equivalent-quality OSS. Rust core with strictly-typed Python bindings.

1K 8 0
ashutoshvarma
pyxpdf

Fast and memory-efficient Python PDF Parser based on xpdf sources

1K 44 17
moria97
fastpdf4llm

Lightweight and fast library to convert PDF to markdown format.

1K 1 0
vpoulailleau
md-to-pdf

Yet another Markdown to PDF converter

767 2 0
gastongouron
ironpress

Pure Rust PDF converter, no browser, no external dependencies. Supports HTML with inline CSS, Markdown, and document conversion with a built-in layout engine.

710 195 10
benjamin-awd
monopoly-sg

PDF parsing for Singaporean banks

704 172 46
OthmaneBlial
pdf-editor-offline

PDF Editor Offline: A powerful open-source Free PDF editor that runs 100% offline, ensuring complete privacy and zero cost. Edit, convert, merge, split, compress, organize, and secure PDFs directly on your machine—no cloud uploads, no subscriptions, no accounts. Fully featured with annotations, OCR, batch processing, broad format conversion.

681 4 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery