Long Context Python Packages

megabyte-pytorch

Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

4K 655 55

recurrent-memory-transformer-pytorch

Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch

4K 424 19

ring-attention-pytorch

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

4K 546 36

infini-transformer-pytorch

Implementation of Infini-Transformer in Pytorch

3K 112 4

nexusquant-kv

Training-free KV cache compression via E8 lattice quantization; fits longer context in the same VRAM

3K 20 0

kakeyalattice

Discrete Kakeya cover for LLM KV cache: D4/E8 nested-lattice quantisation realising a Kakeya-style tube-cover over the direction sphere. 2.4x-2.8x compression at <1% perplexity loss on Qwen3, Llama-3, DeepSeek, GLM-4, Gemma. Drop-in transformers.DynamicCache. pip install kakeyalattice.

1K 9 2

perceiver-ar-pytorch

Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch

720 95 4

tokenpack-rag

Query-aware semantic chunk selection under LLM context-window budgets.

440 10 0

titans-memory

PyTorch implementation of Titans: Learning to Memorize at Test Time (Behrouz, Zhong & Mirrokni, 2024)

248 1 2

mcp-long-context-reader

A tool to help agents read and query long documents.

230 5 1

ringattention

RingAttention for Transformers with Arbitrarily Large Context.

219 773 53

compactbench

Open benchmark for LLM context compaction methods — measures what survives when you replace conversation history with a compacted artifact. Multi-cycle drift, hidden ranked set.

183 2 0