text-splitting
A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.
High-fidelity context-aware chunking and interactive visualization for RAG. Advanced segmentation for code and documents, because your LLM is only as smart as the fragments you feed it.
Benchmark chunking strategies for your RAG corpus. Get a recommended config. CLI, Python library, and MCP server.
A smart multilingual text chunker for LLMs, RAG, and beyond.