PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Retrieval Python Packages

Python packages with the GitHub topic retrieval. Sorted by relevance, with stars and monthly downloads.
qdrant
fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

13.3M 3K 199
embeddings-benchmark
mteb

MTEB: Massive Text Embedding Benchmark

2.8M 3K 614
xhluca
bm25s

Fast BM25 search in Python, powered by Numpy and Numba

1.5M 2K 99
MinishLab
semble

Fast and Accurate Code Search for Agents. Uses ~98% fewer tokens than grep+read

58K 827 66
beir-cellar
beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

47K 2K 244
VectifyAI
pageindex

📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG

45K 31K 3K
qdrant
fastembed-gpu

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

19K 3K 199
ContextualAI
gritlm

Generative Representational Instruction Tuning

15K 691 50
meinardmueller
libfmp

libfmp - Python package for teaching and learning Fundamentals of Music Processing (FMP)

13K 224 20
robotrocketscience
aelfrice

Bayesian memory that learns from feedback for LLM agents

13K 5 1
ARM-DOE
act-atmos

Atmospheric data Community Toolkit - A python based toolkit for exploring and analyzing time series atmospheric datasets

12K 185 41
mixedbread-ai
mxbai-rerank

Crispy reranking models by Mixedbread

10K 51 7
jaytoone
ctx-retriever

Trigger-Driven Dynamic Context Loading for Code-Aware LLM Agents

9K 5 2
usemoss
inferedge-moss

The retrieval layer for production AI systems. Lightning-fast (<10ms) search without vector databases. Built for browser, edge, on-device, and cloud.

9K 374 36
roomi-fields
rtfm-ai

The open retrieval layer for AI coding agents. Indexes code, docs, legal, research, data — 22 parsers (incl. EPUB, DOCX, ODT), FTS5 + semantic search, knowledge graph. Serves surgical context via MCP. Open source, local, free.

8K 10 2
VectifyAI
openkb

OpenKB: Open LLM Knowledge Base

7K 2K 196
ben-ranford
cellin

Build long-lived multimodal memory, dream over it, and retrieve context with transparent weighting.

7K 0 0
xhluca
bm25

Fast BM25 search in Python, powered by Numpy and Numba

5K 2K 99
intel
intel-extension-for-transformers

âš¡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platformsâš¡

5K 2K 217
bicardinal
brinicle

A resource-efficient C++ vector index engine built for low-RAM production workloads

5K 11 0
answerdotai
byaldi

Use late-interaction multi-modal models such as ColPali in just a few lines of code.

4K 847 93
illuin-tech
vidore-benchmark

Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.

4K 272 35
memodb-io
memobase

User Profile-Based Long-Term Memory for AI Chatbot Applications.

4K 3K 213
lucidrains
memorizing-transformers-pytorch

Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch

4K 644 47
    • Data from PyPI, GitHub, ClickHouse, and BigQuery