Llm Cost Reduction Python Packages

mnemon-ai

Biological nervous systems don't recompute known workflows from scratch. Mnemon gives LLM agents the same primitive — execution memory that caches plans, not responses. 93% token reduction, 2.66ms vs 20s, zero tokens on repeat runs. LangChain, CrewAI, AutoGen.

2K 3 2

claw-compactor

14-stage Fusion Pipeline for LLM token compression — reversible compression, AST-aware code analysis, intelligent content routing. Zero LLM inference cost. MIT licensed.

992 2K 209

supercompress

SuperCompress — learned context compression for LLMs.

403 2 0