PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Docling Python Packages

Python packages with the GitHub topic docling. Sorted by relevance, with stars and monthly downloads.
felixdittrich92
docling-ocr-onnxtr

OnnxTR OCR plugin for Docling

33K 20 0
NameetP
pdfmux

PDF extraction that checks its own work. #2 reading order accuracy — zero AI, zero GPU, zero cost.

3K 63 7
docling-project
docling-graph

Transform unstructured documents into validated, rich and queryable knowledge graphs.

3K 149 23
versionHQ
versionhq

Autonomous agent networks for task automation that requires multi-step reasoning

2K 30 10
tiroq
mdify-cli

Convert PDFs and document images into structured Markdown for LLM workflows

2K 0 0
ENDEVSOLS
longparser

Privacy-first document intelligence engine — parse PDFs, DOCX, PPTX, XLSX & CSV into AI-ready chunks for RAG pipelines. Includes HITL review, 3-layer memory chat, and a production FastAPI server.

1K 26 2
stevereiner
flexible-graphrag-mcp

Python, LlamaIndex, LangChain, Docker Compose: 15 Property Graph, 4 RDF , 10 Vector, OpenSearch, Elasticsearch, Alfresco DBs. 13 data sources (9 auto-sync), KG auto-building, Ontologies, LLMs, Docling or LlamaParse doc processing, GraphRAG, RAG only, Hybrid Search, AI Chat. TypeScript React, Vue, Angular frontends, FastAPI REST backend, MCP Server.

1K 127 28
stevereiner
flexible-graphrag

Python, LlamaIndex, LangChain, Docker Compose: 15 Property Graph, 4 RDF , 10 Vector, OpenSearch, Elasticsearch, Alfresco DBs. 13 data sources (9 auto-sync), KG auto-building, Ontologies, LLMs, Docling or LlamaParse doc processing, GraphRAG, RAG only, Hybrid Search, AI Chat. TypeScript React, Vue, Angular frontends, FastAPI REST backend, MCP Server.

1K 127 28
DCC-BS
docling-glm-ocr

A docling plugin to integrate a remote hosted GLM-OCR OCR model into docling

919 11 1
jspast
cells2table

Table image parsing with cell detection models

910 1 0
shoryasethia
markdrop

A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functionalities. Markdrop is available on PyPI.

700 204 18
aspose-cells-foss
aspose-cells-foss

A Python library for creating, reading, and modifying Excel files (.xlsx format)

616 9 0
DCC-BS
docling-pp-doc-layout

A Docling plugin for PaddlePaddle PP-DocLayout-V3 model document layout detection.

405 5 0
ghodsizadeh
pdf2csv

A python library and CLI tool to convert PDF files to CSV files.

359 42 5
aspose-cells-foss
aspose-cells-foss-for-python

A Python library for creating, reading, and modifying Excel files (.xlsx format)

357 9 0
Kubenew
ragpipe-lite

ragpipe-lite: unified RAG ingestion pipeline (loaders, chunking, embeddings, vector store export).

355 1 0
aksarav
pdfstract

PDFStract - The Extraction and Chunking Layer in Your RAG Pipeline - Available as CLI - WEBUI - API

321 147 12
Sinapsis-AI
sinapsis-docling

Package to perform document conversion using Docling

217 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery