PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Rag Pipeline Python Packages

Python packages with the GitHub topic rag-pipeline. Sorted by relevance, with stars and monthly downloads.
davidpirogov
toon-llm

Token-Oriented Object Notation (TOON) is an LLM-optimized data serialization format implemented in Python.

26K 9 3
Project-Navi
navi-sanitize

Deterministic input sanitization for untrusted text — invisible characters, homoglyphs, and encoding tricks, handled before your code sees them. Zero dependencies, no ML. Python 3.12+.

17K 2 0
JonathanBerhe
gvdb

High-performance distributed vector database with high-dimensional vector support

8K 0 0
glemiu6
pyragcore

A reusable, modular RAG (Retrieval-Augmented Generation) core library built on FAISS and Ollama

6K 0 0
superagentxai
superagentx

Move from idea to production in hours with policy-driven autonomous AI agents. Unified Control Plane: Centralised tools, MCPs, models, data, and policies with consistent observability and governance.

4K 197 43
project-david-ai
projectdavid-platform

Deployment orchestrator for the Project David / Entities platform

3K 1 0
SynapseKit
synapsekit

Minimal, async-first Python framework for production LLM apps- 2 hard deps, no magic, no SaaS.

3K 19 19
kaiserkonok
keovil

Private Query Interface for Documents & Structured Data

2K 3 0
nanonets
nanoindex

Agentic RAG Harness for long documents, Tree and Graph based reasoning. Cited answers down to the pixel

2K 55 5
ddickmann
latence-solver

High-performance late-interaction retrieval engine for on-prem AI. ColBERT/ColPali multi-vector search with Rust fused MaxSim, Triton GPU kernels, ROQ quantization, LEMUR routing, WAL-backed CRUD, and a FastAPI server — single machine, CPU or GPU.

2K 16 1
superagentxai
superagentx-handlers

Move from idea to production in hours with policy-driven autonomous AI agents. Unified Control Plane: Centralised tools, MCPs, models, data, and policies with consistent observability and governance.

1K 197 43
SwiftWing21
helix-context

Agent knowledge index — IDF-weighted retrieval with deterministic SLM embedded models. SQLite, 17K+ entries on consumer hardware at 5x compression. Know-vs-Go

1K 6 0
vrraj
vrraj-bm25s-retriever

Lexical routing layer for LLM tool selection. Filter MCP-discovered and registry tools before prompt assembly using fast BM25S retrieval.

1K 1 0
ddickmann
voyager-index

High-performance late-interaction retrieval engine for on-prem AI. ColBERT/ColPali multi-vector search with Rust fused MaxSim, Triton GPU kernels, ROQ quantization, LEMUR routing, WAL-backed CRUD, and a FastAPI server — single machine, CPU or GPU.

1K 16 1
laxmimerit
ragwire

Production-grade RAG toolkit — ingest PDFs, DOCX, XLSX into Qdrant with LLM metadata extraction, hybrid search, and SHA256 deduplication.

1K 17 5
vunone
ennoia

Declarative Document Indexing (DDI) Schemas for RAG — LLM-powered pre-indexing and hybrid retrieval.

920 43 4
hallengray
rag-forge-core

Production-grade RAG pipelines with evaluation baked in

646 7 0
NetApp
netapp-aide-mcp

MCP server for NetApp AI Data Engine

611 0 0
hallengray
rag-forge-evaluator

Production-grade RAG pipelines with evaluation baked in

608 7 0
hallengray
rag-forge-observability

Production-grade RAG pipelines with evaluation baked in

597 7 0
vrraj
vrraj-llm-adapter

Provider-agnostic, registry-driven LLM adapter for text generation and embeddings with normalized outputs - includes an interactive test UI.

561 1 0
sanonone
kektordb-client

AI memory system combining vector search with temporal knowledge graph. Built-in cognitive engine for agents. Supports memory decay, contradiction detection, and MCP integration.

559 73 7
AnubhavChoudhery
cybersec-scanner

A comprehensive security scanner and RAG-based vulnerability analyzer

416 2 1
rodmena-limited
ragit

Correct complete RAG -- built for Highway Workflow Engine

416 4 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery