retrieval-systems
🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines
Lexical routing layer for LLM tool selection. Filter MCP-discovered and registry tools before prompt assembly using fast BM25S retrieval.
[Pytorch] Efficient tokenization library for recommendations and generative retrieval using pre-trained language models. Inspired by "Vectorizing the Trie: Efficient Constrained Decoding for LLM-based Generative Retrieval on Accelerators"