PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Semantic Search Python Packages

Python packages with the GitHub topic semantic-search. Sorted by relevance, with stars and monthly downloads.
lancedb
lancedb

Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.

7.9M 10K 881
embeddings-benchmark
mteb

MTEB: Massive Text Embedding Benchmark

2.8M 3K 614
deepset-ai
haystack-ai

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

893K 25K 3K
unum-cloud
usearch

Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

576K 4K 316
PrithivirajDamodaran
flashrank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

554K 971 69
zilliztech
gptcache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

471K 8K 583
docarray
docarray

Represent, send, store and search multimodal data

129K 3K 243
deepset-ai
farm-haystack

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

73K 25K 3K
neuml
txtai

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

43K 13K 829
khoj-ai
khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

37K 35K 2K
tobocop2
lilbee

Terminal-first local search and AI chat over your documents, code, and crawled websites. Semantic + hybrid search, vision OCR, auto-built wiki, browsable GGUF model catalog. Works as CLI, TUI, MCP server, REST API, or Python library. Offline by default, no sidecar services.

37K 18 3
weaviate
weaviate-cli

CLI tool for Weaviate

22K 31 19
sysid
bkmr

Knowledge Management for Humans and Agents

21K 251 10
knowusuboaky
vectrixdb

Where vectors come alive - A lightweight, visual-first vector database with embedded ML models. No API keys required.

20K 2 0
alibaba
zvec

A lightweight, lightning-fast, in-process vector database

19K 10K 550
khoj-ai
khoj-assistant

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

18K 35K 2K
unum-cloud
usearch-iscc

Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

17K 4K 316
54yyyu
zotero-mcp-server

Zotero MCP: Connects your Zotero research library with Claude and other AI assistants via the Model Context Protocol to discuss papers, get summaries, analyze citations, and more.

15K 3K 282
srclight
srclight

Deep code indexing MCP server for AI agents. 25 tools: hybrid FTS5 + embedding search, call graphs, git blame/hotspots, build system analysis. Multi-repo workspaces, GPU-accelerated semantic search, 10 languages via tree-sitter. Fully local, zero cloud dependencies.

15K 41 8
alexklibisz
elastiknn-client

Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.

15K 392 50
chonkie-inc
catsu

🔌 Want one client library for all your embeddings? 💙 Choose Catsu! 🐱

14K 67 3
ywatanabe1989
openalex-local

Local OpenAlex database (284M+ works, FTS5 search, abstracts, SciTeX IF) with Python API, CLI, HTTP relay, and MCP server.

14K 1 0
qualixar
superlocalmemory

World's first local-only AI memory to break 74% retrieval and 60% zero-LLM on LoCoMo. No cloud, no APIs, no data leaves your machine. Additionally, mode C (LLM/Cloud) - 87.7% LoCoMo. Research-backed. arXiv: 2603.14588

11K 150 18
memtomem
memtomem

Markdown-first, long-term memory infrastructure for AI agents. Hybrid BM25 + semantic search across markdown/code files via MCP.

10K 4 23
    • Data from PyPI, GitHub, ClickHouse, and BigQuery