PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Retrieval Augmented Generation Python Packages

Python packages with the GitHub topic retrieval-augmented-generation. Sorted by relevance, with stars and monthly downloads.
qdrant
fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

13.3M 3K 199
redis
redisvl

Redis Vector Library (RedisVL) -- the AI-native Python client for Redis.

2.2M 400 84
deepset-ai
haystack-ai

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

893K 25K 3K
PrithivirajDamodaran
flashrank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

554K 971 69
FlagOpen
flagembedding

Retrieval and Retrieval-augmented LLMs

529K 12K 876
oeken
needle-python

Needle simplifies building RAG pipelines.

238K 45 2
datastax
langchain-graph-retriever

Graph traversal for improved RAG

226K 87 28
datastax
graph-retriever

Graph traversal for improved RAG

226K 87 28
HKUDS
lightrag-hku

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

211K 35K 5K
Clarifai
clarifai

Experience the power of Clarifai’s AI platform with the python SDK. 🌟 Star to support our work!

152K 44 8
illuin-tech
colpali-engine

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

149K 3K 251
Blaizzy
mlx-embeddings

MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.

75K 384 48
deepset-ai
farm-haystack

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

73K 25K 3K
VectifyAI
pageindex

📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG

45K 31K 3K
neuml
txtai

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

43K 13K 829
awslabs
cdklabs-generative-ai-cdk-constructs

AWS Generative AI CDK Constructs are sample implementations of AWS CDK for common generative AI patterns.

41K 535 75
HKUDS
raganything

"RAG-Anything: All-in-One RAG Framework"

33K 20K 2K
NVIDIA-AI-Blueprints
nvidia-rag

This NVIDIA RAG blueprint serves as a reference solution for a foundational Retrieval Augmented Generation (RAG) pipeline.

32K 619 263
cognitx-leyton
cognitx-codegraph

🕸️ Code knowledge graph for Claude Code & AI coding agents — index TypeScript, NestJS, React into Neo4j and query architecture in Cypher

28K 8 0
The-Pocket
pocketflow

Pocket Flow: 100-line LLM framework. Let Agents build Agents!

27K 11K 1K
LearningCircuit
local-deep-research

~95% on SimpleQA (e.g. Qwen3.6-27B on a 3090). Supports all local and cloud LLMs (llama.cpp, Ollama, Google, ...). 10+ search engines - arXiv, PubMed, your private documents. Everything Local & Encrypted.

24K 8K 670
qdrant
fastembed-gpu

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

19K 3K 199
biocypher
biocypher

A unifying framework for biomedical research knowledge graphs

12K 304 51
mixedbread-ai
mxbai-rerank

Crispy reranking models by Mixedbread

10K 51 7
    • Data from PyPI, GitHub, ClickHouse, and BigQuery